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(57) Abstract 

The present invention provides compositions and methods for producing a heterologous protein of interest by inserting a copy of a 
gene encoding the heterologous protehi of hiterest hto the chromosome of a host cell, such as £. coU. A chnmiosorhal transfer DNA (a 
circular, non-self-replicating DNA) is used to totegrate the gene encoding the heterologous protein of interest bto the host cell chromosome. 
The chiomosomal transfer DNA comprises at least one selectable marker and may optionally include repeated DNA sequences flanking die 
selectable maricer, facilitatmg chromosomal amplification of the integrated DNA. Ihe gene encoding the protein of interest may be expressed 
after integration into die chromosome of the host cell; selection for chromosomal amplification nmy be peifbzmed prior to expression of 
the gene. 
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CHROMOSOMAL EXPRESSION OF HRTRT^OT.OnnTTR 
GENES IN BACTERIAL CRT J.S 

TECHNICAL FIELD 

This invention is related to the field of expression of heterologous genes 

in bacteria. 

BACKGROUND ART 

Genetic engineering has made it possible to produce large amounts of 
heterologous proteins or polypeptides in bacterial cells by means of recombinant 
expression systems, especially by expression in such prokaryotes as Escherichia coli (E. 
coli) . 

The expressed heterologous proteins may be of mammalian, other 
eukaryotic, viral, bacterial, cyanobacterial, archaebacterial, or synthetic origin. 

Unlike native bacterial proteins, which can often be efGciently 
accumulated within a bacterial cell even when encoded by a single chromosomal gene 
copy, there are no published reports to date of heterologous proteins being successfully 
accumxilated within bacterial cells to levels exceeding 0.1% of total cell protein when 
expressed from a single chromosomal gene location. 

0.1% of total cell protein (ISO micrograms protein per trillion bacterial 
cells) is chosen as a practical measure of successful accumulation of protein because it 
approximately defines the lower limits of (a) economically significant accumidation of a 
desired protein by contemporary recombinant bacterial production standards, and (b) 
visual detection of a protein band by Coomassie-stained polyacrylamide gel analysis of 
whole bacterial cell extracts. 

The relatively poor performance of non-bacterial genes when expressed in 
bacterial cells, even when placed under the control of the strongest known bacterial 
promoters, has been generally attributed to poor translation of the non-bacterial mRNAs 
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and rapid degradation of newly synthesized non-bacterial proteins. It has almost 
universally been assumed that, in order to achieve successful accumulation of 
non-bacterial or heterologous proteins in bacterial cells, the genes encoding the 
heterologous proteins must be located on multicopy plasmid vectors. 
5 A gene carried on one of the multicopy plasmids commonly used for 

cloning and expressing genes encoding heterologous proteins in E. coli usually has a 
copy number of more than 20 copies/cell. Even low copy nimiber plasmids (e.g., 
pACYC177 and pLG339) generally exist at 6-10 copies per cell. One disadvantage 
imposed by plasmid gene dosages is that the expression of even minute amoimts of some 

10 foreign proteins can kill host cells (see Meth. Enzymol. 12^:63-65, ed. D, Goeddel, 
1990). For this reason, it would be advantageous to reliably limit the copy number of 
genes encoding such toxic gene products, such as by integrating the gene into the 
bacterial chromosome at one or a small number of copies per cell. For example, such a 
system would allow one to make-more representative cDNA e)q)resi5ion libraries in 

15 bacterial hosts if the high-copy expression of one or more of the cDNAs in the library 
could kill the bacterial host or cause it to grow poorly. 

Chromosomal integration of genes encoding heterologous polypeptides 
would also be advantageous as an alternative means for expression of heterologous 
proteins in bacterial host cells. Multicopy vectors are often unstable and require the use 

20 of antibiotics in the growth medium for maintenance. Present methods of integrating 
foreign genes into the bacterial chromosome suffer &om inefficiency, the mability to 
control die site of integration of the foreign gene, and/or the inability to control the copy 
number of the integrated gene. Most importantly, all efforts to date to create 
recombinant DNA constructs on the bacterial chromosome, wherein a bacterial promoter 

25 is fused to a heterologous gene, have involved the creation of viral or plasmid 

intermediates carrying the construct. Because such intermediates replicate at high copy 
number, they may be difficult or even impossible to recover in cases where the foreign 
gene product is toxic to the bacterial cell. Expression of the encoded gene, even at low 
levels, may be toxic to the host cells, due to the high copy number of these intermediates, 

30 which effectively multiplies the level of expression. 
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Previous methods for achieving the integration of heterologous genes into 
the chromosome of a bacterial host include the use of phage lambda vectors. The phage 
DNA in circular form is inserted linearly into the bacterial chromosome by a single site 
. specific recombination between a phage attachment site (attP), 240 bases long, and a 
5 bacterial attachment site (allB), only 25 bases long. The two sites have 15 bases in 
common. This site-specific recombination is catalyzed by a special integrase» specified 
by the phage gene INI (virology pp, 56-57 (Lippincott, 2nd ed., R. Dulbecco and H. 
Ginsberg, eds., Philadelphia, PA, 1 985). 

Phage vectors which are INT can be integrated into the chromosome in a 

1 0 normal fashion as long as integrase is suppUed in trans , e.g., by an INT- f helper phage 
(see, e.g., Borck et al. (1976) Molec. Gen. Genet. 146:199-207). 

Phage vectors which are both att* and INT- can likewise be integrated into 
the bacterial chromosome as double lysogens by using att +INT- f helper phage. Double 
lysogens are formed by linkage of the prophages at tiie bacterial attachment site and are 

15 integrated into the chromosome by general bacterial recombination between homologous 
sequences on the defective phage and on the helper phage (see e.g., Struhl et al. (1976) 
Proc. Natl. Acad. Sci. USA 22:1471-1475). Similarly, it is also possible to integrate 
non-replicating colEl replicons into the genome of polA strains of E. coli by means of 
recombination between the host chromosome and homologous sequences carried by the 

20 plasmid vector (Greener and Hill (1980) J. Bacteriol. 144:312-321). 

More recently, systems have been specifically designed for die integration 
of foreign genes into a bacterial host chromosome. For example, U.S. Patent No. 
5,395,763 (Weinberg et al.) discloses a chromosomal expression vector for the 
expression of heterologous genes. This vector was created utilizing a multicopy number 

25 plasmid intermediate, into which the gene of interest is cloned, placing the gene in 
operable Imkage with the bacteriophage middle promoter, Pm. This plasmid 
intermediate, which comprises a defective Mu genome (lacking the genes necessary for 
the formation of phage particles) is introduced into a packaging strain to produce 
infectious Mu particles, which are then used to introduce the vector into host cells and 

30 integrate the vector into the host cell genome. This vector system is amplifiable once 
integrated into the host cell genome, but the mechanism of amplification (replicative 
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transposition) is normally toxic to the host cell, due to integration of the replicating 
prophage into essential host cell genes (Neidhardt et al., Escherichia COLI and 

SALMONELLA TYPHIMURIUM: MOLECULAR AND CELLULAR BIOLOGY (American Society for 
Microbiology, Neidhardt et al. eds., Washington, D.C., 1987). Because the amplification 
of this integrated prophage is normally toxic, it is very difficidt to obtain and propagate a 
host cell strain carrying the amplified integrated DNA. This then requires that the gene 
be amplified each instance fhaX protein production is desired. 

Diederich et al. ((1992) "New plasmid vectors for integration into the 1 
attachment site attB of the Escherichia coli chromosome", Plasmid 2S: 1 4-24) also 
disclose a system for introducing a gene onto the chromosome of a bacterial host cell. 
This system utilizes a set of multicopy plasmid vectors which can be integrated into a 
bacterial chromosome via a phage lambda attachment site. A DNA sequence encoding a 
promoter operably linked to a gene of mterest is cloned into one of the described 
multicopy number plasmid vectors, the plasmid's origin of replication is removed by 
restriction enzymes, and the resulting DNA is recircularized and transferred to a host 
cell, ^ere it integrates into the chromosome. 

These new gene transfer systems suffer from the same defect as earlier 
systems. Both USP 5,395,763 (Weinberg et al.) and Diederich et al. require that the gene 
of interest be cloned into a multicopy number plasmid v\Me in an operable configuration 
during die construction of the transfer DNA. The configuration of this multicopy 
number plasmid makes expression of toxic foreign genes difficult, if not impossible, 
because the (toxic) gene of interest will be ejqiressed as the multicopy number plasmid is 
propagated. 

Accordingly, there is a need for a method of producing heterologous 
proteins which can produce large amounts of protein and which mifiimiyi»g any toxic 
effect of the heterologous protein to host cells during construction of the producing 
strain. Applicants have shown surprisingly high protein accumulation (approxunately 
20% of total cell protein) firom expression of low (approximately two) copies of the gene 
encoding the heterologous protein as shown in Example 2. 
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SUMMARY OF THE IhA^NTTON 

The present invention provides methods and compositions for production 
of heterologous proteins in bacterial host cells such as E. coli by integrating a 
chromosomal transfer DNA (a circular, non-self replicating DNA) into the chromosome 
5 of a host cell. The chromosomal transfer DNA comprises one or more copies of a gene 
encoding the heterologous protein of interest. 

The present invention, therefore, provides a method for producing a 
heterologous protein of interest, comprising: 

mtegrating a chromosomal transfer DNA into the chromosome of a host 
10 cell such that chromosomal amplification of the integrated DNA is facilitated, the 
chromosomal transfer DNA comprising at least one copy of a gene encoding a 
heterologous protein of interest and a selectable marker, and 

expressing the gene encoding the heterologous protein of interest, 
wherein the gene was at no time operably linked to a promoter fimcidonal in the host cell 
IS in a multicopy number plasmid during the construction of the transfer DNA, and 

wherein the heterologous protein of interest accumulates to a level of at 
least 0.1% of total cell protein. 

The chromosomal transfer DNA may optionally comprise a promoter 
operably linked to the gene encoding the heterologous protein of interest, wherein the 
20 operable Imkage is created by circularization of the chromosomal transfer DNA. 

Optionally, the chromosomal transfer DNA may further comprise 
duplicate DNA flanking the selectable marker. The duplicate DNA may optionally 
comprise copies of the gene encoding the heterologous protein of interest operably linked 
to a promoter. 

25 The methods for expression of heterologous proteins may optionally 

include the step of selecting for chromosomal amplification. 

The invention also provides methods for producing a chromosomal 
transfer DNA, comprising ligating together firagments firom a first and a second plasmid 
vector: 
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the first plasmid vector comprising a first origin of replication, and a first 
gene encoding a heterologous protein of interest wherein the furst gene is not operably 
linked to a promoter and a first copy of a duplicate DNA; 

the second plasmid vector comprising a second origin of replication, and a 
first promoter and a second copy of a duplicate DNA; 

wherein the origins of replication and the promoter fimction in the host 
cell, and wherein either said first plasmid or said second plasmid comprises a selectable 
marker. 

Optionally, the first plasmid may fijrther comprise a second promoter not 
operably linked to the first gene encoding the heterologous protein of interest and the 
second plasmid may fiirther comprise a second copy of the gene encoding the 
heterologous.protein of interest not operably linked to the first promoter. 

Also provided are chromosomal transfer DNAs for use in production of 
heterologous protems of interest, comprismg: 

a non-bacterial gene of interest operably linked to a promoter fimctional in 
a host cell; and 

a selectable marker flanked by duplicate DNA, 

wherein said gene encoding a heterologous protein is at no time operably 
Unked to a promoter fimctional in a host cell on a multicopy number plasmid vector 
during the construction of the transfer DNA. 

Optionally, the chromosomal transfer DNA may fiirther comprise two or 
more copies of the gene encoding the non-bacterial protem of interest, wherein the copies 
of the gene flank the selectable marker. 

BRIEF DESCRIPTION OF TH E DRAWINGS 

Figure 1 shows steps in the in vitro formation of a chromosomal transfer 
DNA, a DNA circle which lacks an origin of rq}lication (and thus is incapable of 
self-replication) and is suitable for integration of a foreign gene into the bacterial 
chromosome. Until the chromosomal transfer DNA is formed, the foreign gene to be 
expressed (here an IGF-I fiision gene) is separated from a fimctional bacterial promoter 
(here the T7 promoter). 
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Figure 2 shows a chromosomal transfer DNA formed from the ligation of 
two DNA fragments. One of the fragments contains a fusion gene comprising sequences 
encoding EjSfili DsbA, yeast ubiquitin (beginning with a Met), and human insulin-like 
growth factor I ("dsbA-ubi-IGF") (not beginning with a Met), as discussed in co-owned, 
co-pending U.S. patent application no. 08/100,744, filed August 2, 1993. The other 
DNA fragment contains a T7 promoter. Both the chromosomal transfer DNA and the 
bacterial chromosome contain a recombmation site from phage lambda, att P* The 
chromosomal transfer DNA is transfomied into R coli strain B 1 384, which makes 
integrase (INT) under the control of the trp promoter (P-trp). Intisgrase catalyzes 
site-specific integration of the chromosomal transfer DNA into the bacterial chromosome 
at the aJl site. The trp promoter can be induced during transformation by adding 1 mM 
indole acrylic acid (lAA) to the medium. Cells with integrated chromosomal transfer 
DNA sequences are resistant to chloramphenicol (CAM-r, 10 Tg/ml). 

Figure 3 shows a B1384 chromosomal integrant resulting from the 
process described in Figure 2. The integration can be confirmed by amplifying host 
chromosomal DNA by PGR with various primer sets (e.g., UBUF x IGFR, 1243 x 
T7REV, or TRPPF x 1239), digesting the amplified firagments with the appropriate 
restriction enzyme (SacII, HinCII, or BamHI, respectively), and sizing the products by 
gel electrophoresis). 

Figure 4 shows a Western blot of whole cell lysates of chloramphenicol 
resistant W31 10DE3 transductants. Also included are protein size markers (far left lane) 
and IGF fusion protein (control). 

Figure 5 shows a Westem blot of whole cell lysates of kanamycin 
resistant transductants. 

Figures 6-9 show diagrammatically the general strategy for construction 
of chromosomal transfer DNA's. Figure 6 shows a chromosomal transfer DNA 
comprises a suigle copy of the gene encoding the heterologous protein of interest and 
two copies of a second gene which frank the selectable markers, facilitating 
chromosomal amplification after integration of the chromosomal transfer DNA. Figure 7 
shows the "double cassette" system utilized for expression of heterologous proteins in 
Examples 2 through 6 and Example 8. This embodiment of the chromosomal transfer 
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DNA comprises two copies of the gene encoding the heterologoxis protein of interest 
flanking the selectable markers, facilitating chromosomal amplification of the integrated 
DNA. Figures 8 and 9 show alternate embodiments of "promoter-less" chromosomal 
transfer DNAs. These embodiments utilize a DNA sequence homologous to a segment 
of the host cell chromosome. Integration of promoter-less chromosomal transfer DNAs 
results in formation of an operably linkage between a host cell promoter and the gene 
encoding the heterologous protein of interest and the creation of duplicate DNA 
sequences flanking the selectable markers. 

Figures 10-13 show the plasmid genealogy of chromosomal transfer 

DNAs. 

Figure 14 shows the strategy for construction of the two DNA sources 
used in the double cassette system* 

Figures 1 5 and 1 6 show the strategy used to construct chromosomal 
transfer DNAs for integration and expression of the yeast ubiquitin hydrolase gene. 

Figure 1 7 shows the strategy for constructing the chromosomal transfer 
DNA used to integrate and express a gene encoding a DsbA::ubiquitin::IGF-I fusion 
protein. 

Figure 18 shows the strategy for constructing the chromosomal transfer 
DNA used to integrate and express a gene encoding a DsbA::2A::IGFBP-3 fusion 
protein. 

Figures 19 and 20 show the strategies used to construct chromosomal 
transfer DNAs used to integrate and express genes coding for DsbA::2A::IGF-I (Figure 
19) and DsbA::3C::IGF-I (Figure 20) fusion proteins. 

Figure 21 shows the strategy used to construct the chromosomal transfer 
DNA used to integrate and express the gene encoding a DsbA::ubiquitin::TGF-p2 fusion 
protein. 

Figure 22 shows the strategy used to construct the chromosomal transfer 
DNA used to integrate and express a gene encodmg a DsbA::3C::IGFBP-3 fusion 
protein. 

Figure 23 shows an coomassie blue-stained SDS-PAGE gel of whole cell 
lysates of isolates expressing IGF-I fusion proteins. c49222, c49258#46, and c53063 
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express a DsbA: rubiquitin: :IGF-I fusion protein (left arrow), which is easily visible. 
Surprising, this high level of expression is seen in c49222 and c49258#46, which were 
not amplified (i.e. there was no selection for chromosomal amplification of the integrated 
DNA). c57264#5 and c57264#28 express a DsbA::3C::IGF-I fixsion protein while 
5 c57265#44 and c57265#54 express a DsbA::2A::IGF-I fiision protein. Again, the 
expressed fusion protein is easily visible. Densitometric analysis of this gel indicates 
that all of the isolates accumulate protein in excess 19% of total cell protein (average 
protein accumulation is 25.7% of total cell protein). 

Figure 24 shows a Southern blot of chromosomal DNA isolated from 

10 C49222, c49258#46, c53063, c57264#5, c57264#28, c57265#44, and c57265#54. The 
blot was probed with a DNA fiagment encoding ubiquitin fused to IGF-I. The higher 
molecular weight band in each lane represents a single copy of the integrated IGF4 
fusion protein gene in each isolate. The lower molecular weight band also represents the 
integrated IGF-I fusion protein gene, but this fiagment can be amplified by chromosomal 

15 amplification. Isolates c53063, c57264#5, c57264#28, c57265#44, and c57265#54 have 
clearly been amplified, showing about 3 to 5 fold amplification. 

Figure 25 shows coomassie blue-stained SDS-PAGE gels showing protein 
accumulation in isolates carrying integrated genes encoding IGFBP-3 fusion proteins. 
A) shows protein accumulation in an isolate expressing a DsbA::2A::IGFBP-3 fusion 

20 protein. The right lane shows protein expression after induction of T7 RNA polymerase 
by addition of IPTG to the culture medium. B) shows protein accumulation in an isolate 
expressing a DsbA::3C::IGFBP-3 fusion protein. As in Figure 23, the bands representing 
the fusion protein are easily visible. Densitometric scanning of these gels found that the 
accumulated protein represented 22.6% in Panel A, and the two isolates in Panel B 

25 accumulated 33 .% and 28.2% of total cell protein (left to right, respectively). 

Figur6 26 shows a coomassie blue-stained SDS-PAGE gel showing 
protein accumulation firom host cells expressing a gene encoding 
DsbA::ubiquitin::TGF-p2. M indicates molecular weight markers and C indicates a 
positive control. The two Plasmid lanes (Lanes 1 and 2) are used as a standard to 

30 compare protein accumulation from multicopy number plasmid vectors to protein 

accumulation fi'om genes integrated into the chromosome. Lanes 3 and 4 are whole cell 



wo 96/40722 . PCT/US96/09006 

10 

lysates of isolates which were negative for T7 RNA polymerase activity when streaked 
against phage 4107. Densitometric analysis of this gel showed that the plasmid strain 
accumulated protein to 26.4% of total cell protein. Protein accumulation was measured 
for isolates 48, 56, 59, 65 and 66, and showed protein accumulation to 36.7%, 33.3%, 
32.1%, 29.5%, and 267%, respectively. 

MODES FOR CARRYING OT I T THR TNVKNmnxr 

The present invention resides in (a) the creation of an operable linkage 
between a promoter and a gene encoding a heterologous protein of interest with the 
linkage being foraied either during the construction of a chromosomal transfer DNA or 
as a result of its integration into the host cell chromosome and (b) the simultaneous 
creation of a means for the appropriate chromosomal amplification of the integrated gene 
of interest. 

In the preferred embodiments, the creation of the chromosomal transfer 
DNA simultaneously achieves two goals; (1) the operable linkage of the promoter and 
the gene of interest and (2) the positioning of duplicate DNA sequences flanking a 
selectable marker (which can function as a means to facilitate the amplification of the 
chromosomal transfer DNA). Another embodiment creates the operable linkage between 
the gene and the promoter during creation of the chromosomal transfer DNA, while the 
means for chromosomal amplification (diq)licate DNA sequences flanking the 
chromosomal transfer DNA) is created as a result of the integration. 

Other methods can achieve either or both of these results by integration of 
a chromosomal transfer DNA into a suitable site on the chromosome. For example, 
integration of a gene of interest near a promoter on the bacterial chromosome can be 
designed to result in an operable linkage (for example, by integrating a chromosomal 
transfer DNA into an operon on the host cell chromosome). The site of integration or 
sequences adjacent to the site of integration may facilitate amplification (e.g. where the 
site is located in a transposable element, by providing duplicate DNA sequences, or even 
by providing a region of DNA sequence homologous to a portion of the chromosomal 
transfer DNA, thus providing duplicate DNA sequences). 
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The present invention employs "chromosomal transfer DNA'* which may 
be used to simply, efficiently, and reliably insert a copy of a heterologous gene into the 
chromosome of a host cell, e.g., E^sqH- A chromosomal transfer DNA is a circular DNA 
comprising one or more copies of a gene encoding a heterologous protein of interest, a 
5 selectable marker (e.g., an antibiotic resistance gene), a recombination site (e.g., a 
site-specific recombination site such as lambda altP or a&B or a DNA sequence 
homologous to a segment on the host cell chromosome), and means for facilitating the 
amplification of the chromosomal transfer DNA following recombination into the host 
cell chromosome, and lacking an origin of replication or autonomously replicating 

10 sequence (ARS). The chromosomal transfer DNA is therefore incapable of replicating 
independentiy when introduced in to the host cell. The chromosomal transfer DNA may 
optionally carry a promoter operably linked to the gene of interest. 

When a chromosomal transfer DNA carrying a site-specific recombination 
site is introduced into a host cell having a chromosome which contains a second, similar 

1 5 recombmation site (e.g., another attP or afiB site), expression in the host cell of an 
en2yme which is capable of catalyzing the site-specific recombination of the 
recombination sites (e.g., integrase) results in the integration of the vector into the host 
cell chromosome at the recombination site. This site-specific recombmation process is 
much more efficient than general recombination systems acting on homologous vector 

20 and host chromosomal sequences and results in integrated sequences having greater 
stability, particularly when integrase synthesis can be controlled. Integrase may also be 
provided by a plasmid or other DNA molecule transiently or stably present in the host 
cell at the time when the chromosomal transfer DNA is introduced. 

It will be apparent to one skilled in the art that there are a variety of 

25 methods other than the preferred method utilizing attP, attB, and INT which may be used 
to integrate a chromosomal transfer DNA into the chromosome of a host cell. For 
example, non-replicating colEl replicons, transposable elements, or even naked DNA 
carrying sequences homologous to sequences found on the host chromosome may be 
used to insert the chromosomal transfer DNA into the host chromosome. The multicopy 

30 colicin plasmids ColEl, CloDFlS, ColK, and ColA all comprise site-specific 

recombination systems including a £is- and ^aos-acting element. For use in the present 
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invention, the cifi-acting element from one of these plasmids may be included on the 
chromosomal transfer DNA and the Inns-acting element may be on the chromosomal 
transfer DNA or provided by the host cell Transposons, such as the insertion sequence 
(IS) and Tn3 families of transposons may be used to integrate DNA into the chromosome 
of a host cell. As with the colicin plasmids described above, the cis-acting transposon 
elements are included on the chromosomal transfer DNA, while the trans -acting factor 
may be included on the chromosomal transfer DNA or provided by the host cell. The 
chromosomal transfer DNA may also cany a DNA sequence homologous to a sequence 
found on the host cell chromosome, facilitating integration of the chromosomal transfer 
DNA by homologous recombination. All of these methods fall within the scope of the 
invention. 

. An important feature of this approach is that the gene encoding the 
heterologous protein of interest is at no time operably linked to a functional promoter on 
a multicopy vector during construction of the transfer DNA. By keeping a functional 
promoter separated from the gene of interest until immediately before the foreign gene is 
introduced mto the cell at low copy number, the potential toxic or lethal effects of the 
gene product can be minimized. A toxic foreign gene will not be expressed from a 
multicopy number plasmid if the gene is not operably linked to a promoter. Other 
methods for integrating a gene of interest into the host cell chromosome utilize 
multicopy number plasmids carrying a gene of interest operably linked to a pix)moter 
(e.g., Diederich et al. and Weinberg et al.); these genes will be expressed during the 
propagation of the plasmid, making it extremely difficult, if not impossible, to produce 
sufficient quantities of the plasmid if the gene of interest is toxic to the host cells in 
which the plasmid is propagated. 

The operable linkage between the gene encoding the heterologous protein 
of interest and the promoter may be created as a result of the formation of the 
chromosomal transfer DNA or as a result of integration into the host cell chromosome. 
In the case where the operable linkage is formed as a result of the formation of the 
chromosomal transfer DNA, the linkage is created by circularization of the chromosomal 
transfer DNA. Circularization may be accomplished by, for example, ligation of one or 
more DNA fragnaents to form a circular DNA or by homologous recombination into a 
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circular DNA, which would result in circularization of the insert. Preferably, 
circularization is accomplished by ligation of one or more DNA fragments. 

Alternatively, high level expression of less toxic gene products can be 
accomplished by multiple integrations or by selection for amplification of integrated 
S genes. 

Recombinant DNA Methods and Reagents 

General techniques for nucleic acid manipulation useful for the practice of 
the claimed invention are described generally, for example, in Sambrook et al., 

1 0 MOLECULAR CLONING: A LABORATORY MANUAL, Vols. 1 -3 (Cold Spring Harbor 

Laboratory Press, 2 ed., (1989); or F. Ausubel et al., current protocols in molecular 
BIOLOGY (Green Publishing and Wiley-Interscience: New York, 1987) and periodic 
updates. Reagents useful in nucleic acid manipulation, such as restriction enzymes, T7 
KNA polymerase, DNA ligases and so on are commercially available fiom such vendors 

IS as New England BioLabs, Boeihinger Mannheim, Amersham, Promega Biotec, U.S. 
Biochemicals, and New England Nuclear. 

Dgfinitions 

"Foreign" or "heterologous*^ or "non-bacteriah*^ *Wive" or 
20 *'homolQgQus*^ A 'foreign or '"heterologous'' polypeptide is a polypeptide which is not 
normally found in a host cell of a particular species. The nucleic acid encoding such a 
polypeptide is also referred to as "foreign'' or ''heterologous." For example, insulin-like 
growth factor (IGF), insulin-like growth factor binding protiein (IGFBP), and 
transforming growth factor-beta (TGF-p) are native to mammalian cells and human 
25 rhinovirus 3C protease is native to viruses and virally-infected mammalian cells, but 
these proteins are foreign or heterologous to R coH . A "non-bacterial protein" is a 
protein or polypeptide which is not naturally found in a bacterial cell. Non-bacterial 
proteins include viral and eukaryotic proteins. Non-bacterial, foreign, or heterologous 
proteins may also be fusions between non-bacterial, foreign, or heterologous proteins and 
30 other proteins or polypeptides. For the embodiments encompassed by this invention, 
both "heterologous protein" and "non-bacterial protein" may be expressed. As disclosed 
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herein, genes encoding heterologous or non-bacterial proteins of interest do not contain 
promoters functional in the host cell. The genes must be linked to a separate promoter 
that is functional in the host cell in order to be expressed. A "native" or "homologous" 
polypeptide or DNA sequence, by contrast, is commonly found in the host cell. A 
5 promoter or other sequence effecting, for example, the transcription or translation of a 
gene is also considered ''homologous'' if it is functional in the host cell For example, a 
T7 promoter is considered "homologous" to an E. coli host cell, since, if T7 RNA 
polymerase is present in the cell, the T7 promoter is capable of driving the transcription 
of a polypeptide-encoding sequence to which it is operably linked. 

10 "Genes encoding heterologous, foreign or non>bacterial protein^ " ''Genes 

encoding heterologous, foreign or non-bacterial proteins" contain all of the genetic 
elements necessary for the expression of the heterologous, foreign or non-bacterial 
protein with the exception of a promoter functional in the host cell. These genes 
encompass recombinant genes which may include genetic elements native to the host 

1 5 cell. Further, the coding regions of these genes may optionally be optimized for the 
codon usage of the host cell. 

"Enci2ds" A nucleic acid is said to "encode" a polypeptide if, in its native 
state or when manipiilated by recombinant DNA methods, it can be transcribed and/or 
translated to produce the polypeptide. 

20 "Operablv linked'* A nucleic acid sequence is operably linked when it is 

placed into a functional relationship with another nucleic acid sequence. For example, a 
promoter is operably linked to a coding sequmce if the promoter afifects its transcription 
or expression. Generally, DNA sequences which are operably linked are contiguous and, 
where necessary, in reading frame. 

25 "Recomhinanf' A "recombinant" nucleic acid is one which is made by 

the joining of two otherwise separated segments of nucleic acid sequence in vitro or by 
chemical synthesis. 

"Chromosomal amplification" "Chromosomal amplification" refers to the 
increase in copy number of a DNA sequence on the host chromosome. Chromosomal 

30 amplification does not refer to extrachromosomal amplification such as replication of 
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multicopy number plasmids or in vitro amplification such as the polymerase chain 
reaction (PGR). 

Probes and primers 

5 Nucleic acid probes and primers are isolated nucleic acids, generally 

single stranded, and, especially in the case of probes, ax^ typically attached to a label or 
reporter molecule. Probes are used, for example, to identify the presence of a 
hybridizmg nucleic acid sequence in a tissue or other sample or a cDNA or genbmic 
clone in a library. Primers are used, for example, for amplification of nucleic acid 
10 sequences, e.g., by the polymerase chain reaction (PGR). The preparation and use of 
probes and primers is described, e.g., in Sambrook et al., supra or Ausubel et al. supra . 

Chemical synthesis of nucleic acids 

Nucleic acids, especially short nucleic acids such as amplification 

15 primers, may be produced by chemical synthesis, e.g., by the phosphoramidite method 
described by Beaucage and Garruthers (1981) Tetra. Letts. 22:1859-1862 or the triester 
method according to Matteucci et al. (1981) J. Amer. Chem. Soc. 103:3185, and may be 
performed on automated oligonucleotide synthesizers. A double-stranded Jfragment may 
be obtained &om the single-stranded product of chemical synthesis either by synthesizing 

20 the complementary strand and annealing the strands together under appropriate 
conditions or by adding the complementary strand using DNA polymerase with an 
appropriate primer sequence. 

Features of chromosomal transfer DNA and of plasmids used in their constructini^ 
25 Ghromosomal transfer DNA comprises a DNA firagment encoding a 

selectable marker and a sequence encoding a desired heterologous polypeptide. 
Optionally, a chromosomal transfer DNA may also comprise, in an operable linkage to 
the sequence encoding the desired heterologous polypeptide, transcription and translation 
initiation regulatory sequences and expression control sequences, which may include a 
30 promoter, an enhancer and necessary processing information sites, such as 

ribosome-binding sites, and mRNA stabilizing sequences, as well as any necessary 
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secretion signals, where appropriate, which allow the protein to cross and/or lodge in cell 
membranes, and thus attain its functional topology, or be secreted from the cell. 

Plasmids used in construction of a chromosomal transfer DNA will also 
typically comprise a replication system recognized by the host, including an origin of 
replication or autonomously replicating sequence (ARS). In the case where a plasmid 
used in the construction of a chromosomal transfer DNA carries duplicate DNA 
sequences, the plasmid may be propagated in a rgc" host cell. Prefembly, lec" host cells 
are used for propagation of plasmids used to create chromosomal transfer DNAs and 
plasmids carrying components of chromosomal transfer DNAs when these plasmids 
carry duplicate DNA sequences, and are not generally utilized as host cells for 
integration of chromosomal transfer DNAs. 

Chromosomal transfer DNA may be prepared fix)m such vectors by means 
of standard recombinant techniques well known in the art and discussed, for example, in 
Sambrook et al., supa or Ausubel et al. supra. 

An appropriate promoter and other sequences necessary for efficient 
transcription and/or translation are selected so as to be functional in the host cell. 
Examples of workable combinations of cell lines and expression vectors are described in 
Sambrook et al., supra or Ausubel et al., sajMl see also, e.g,, Metier et al. (1 988) 
Llatue ^:3 1-36. Promoters such as the tip, lac and phage promoters (e.g., T7, T3, 
SP6), tRNA promoters and glycolytic enzyme promoters are useful in prokaryotic hosts. 
Useful yeast promoters include the promoter regions for metallothionein, 
3-phosphoglycerate kinase or other glycolytic enzymes such as enolase or 
glyceraldehyde-3-phosphate dehydrogenase, enzymes responsible for maltose and 
galactose utilization, and other. See, e.g., Hitzeman et al. EP 73,657A, Appropriate 
mammalian promoters include the early and late promoters from SV40 (Fiers et al. 
(1978) liafittfi 222:1 13) or promoters derived from murine Moloney leukemia virus, 
mouse mammary tumor virus, avian sarcoma viruses, adenovirus 11, bovine papilloma 
vims or polyoma virus. In addition, the construct may be joined to an amplifiable gene 
(e.g., DHFR) so that multiple copies of the gene may be made, where desired. For 
appropriate eukaiyotic enhancer and other expression control sequences see, e.g., 
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ENHANCERS AND EUKARYOTIC GENE EXPRESSION (Cold Spring Harbor Press, New York, ' 
1983). 

It is preferable that the promoter driving expression of the heterologous 
gene when integrated in the chromosome of the host is controllable. 
S Chromosomal transfer DNAs and plasmids employed in their construction 

generally comprise a selectable marker, a gene encoding a protein necessary for the 
survival or growth of a host cell transformed with the chromosomal transfer DNA or 
plasmid. Typical selectable markers (a) confer resistance to antibiotics or other toxic 
substances, e.g. ampicillin, neomycin, methotrexate, etc.; (b) complement auxotrophic 

10 deficiencies; or (c) supply critical nutrients not available from complex media, e.g. the 
gene encoding D-alanine racemase for BaciUi* The choice of the proper selectable 
marker will depend on the host cell. 

The chromosome transfer DNA$ of the present invention may contain a 
site-specific recombination site, such as the phage lambda attP site. When transformed 

1 5 into a bacterial host strain (such as E,_c$2li B 1384) which makes the enzyme integrase, 
integrase recognizes the sttP site on the chromosomal transfer DNA and catalyses its 
recombination with an att site (integrase can catalyze a recombination between two attP 
and flttB or two attP sites). Bacterial host cells bearing the integrated DNA are selected 
for on the basis of a selectable marker carried on the integrated DNA. 

20 Thus, integration utilizing site-specific recombination generally involves 

expression of an enzyme such as integrase which can catalyze site-specific recombination 
and the presence of a site recognized by the enzyme on both the chromosomal transfer 
DNA and the bacterial chromosome. Other site-specific recombination systems 
characterized by an "integrase'' or similar enzyme and sites specifically recognized by 

25 the "integrase" could be used as well. 

High level expression of a foreign gene integrated into the chromosome of 
a host cell in multiple copies is also possible, e.g., by incorporating multiple sites in 
the host cell chromosome and introducing multiple chromosomal transfer DNAs into the 
host cell. Additionally or alternatively, host cells containing multiple copies of the 

30 integrated DNA may be obtained by selecting for chromosomal amplification. 

Chromosomal amplification is facilitated when the selectable marker is flanked by 
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duplicate DNA sequences. Preferably, the duplicate DNA sequences flank a first and a * 
second selectable marker. Tlie first selectable marker is effective at low copy number 
and can be used to select for integration of the chromosomal transfer DNA. The second 
selectable marker is preferably effective only at high copy number. Following selection 
5 for integration usmg the first selectable marker, the second selectable marker is then used 
to select for host cells which contain multiple copies of the integrated DNA. 

An important feature of the chromosomal transfer system of the present 
invention is that the gene encoding the heterologous protein is not expressed before 
mtegration; it is not operably linked to a promoter until either (a) the transfer DNA is 

10 constructed in vitro or (b) the chromosomal transfer DNA is integrated mto the host cell 
chromosome. This approach allows one to employ high copy number plasmids as DNA 
sources in constructing the chromosomal transfer DNA. High copy number plasmids 
canymg a toxic heterologous gene are often difficult to propagate v^en the toxic gene is 
operably linked to a promoter. Low copy number plasmids are moire difficult to work 

1 5 with in tiie laboratory. For example, DNA minipxeps may produce inadequate DNA for 
fa vitro manipulations. The chromosomal transfer DNA is constructed fix)m one or more 
DNA sources by circularization of selected DNA fragments. 

When a single DNA is used to construct the chromosomal transfer DNA, 
both the gene encoding the heterologous protein of interest and the promoter are located 

20 on the same DNA, however the gene and promoter are not operably linked. This may be 
accomplished by, for example, placmg the promoter and gene of interest on either side of 
a spacer DNA sequence which blocks any operable linkage (for example, by including a 
terminator sequence). Preferably, this intervening DNA sequence also includes any other 
portions of the source DNA which must be removed for creation of the chromosomal 

25 transfer DNA, such as an origm of replication or ARS. The chromosomal transfer DNA 
is constructed by deleting the DNA sequence which blocks the operable linkage between 
the gene and the promoter, then circularizing the remaining DNA. 

As shown in Figures 7, 8, and 9, chromosomal transfer DNAs may 
optionally include a DNA sequence for the expression of coli cyclophilin, as described 

30 in U.S. Patent No. 5,459,05 1 . 
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There are several methods by which one may construct a chromosomal 
transfer DNA using two or more DNA sources. In one preferred embodiment, shown in 
Figure 7, also uses two DNA sources. In this embodiment, each of the two DNA sources 
carries a copy of the gene encoding the heterologous protein of interest and the promoter, 
5 but the gene encoding the heterologous protein of interest and promoter are not operably 
linked on either DNA source. As with the previously described embodiment, other 
necessary sequences may be carried by either DNA source (alternatively the other 
necessary sequences may be provided by one or more accessory DNA sources). The two 
DNA sources are cleaved, then joined to each other, forming a circular chromosomal 

10 transfer DNA which has two copies of the foreign gene, each operably linked to a copy 
of a promoter. The promoter from the first DNA source is operably linked to the gene 
encoding the heterologous protein of interest from the second DNA source, and the 
promoter from the second DNA source is operably linked to the gene encoding the 
heterologous protein of inters from the first DNA source. 

1 5 Chromosomal transfer DNAs may also be designed without promoters 

(Figures 8 and 9). These promoter-less chromosomal transfer DNAs are integrated into 
target sites on the bacterial chromosome which place the gene encoding the heterologous 
protein of interest into an operable linkage with a promoter on the host cell chromosome. 
The chromosomal transfer DNA of this embodiment includes a copy of a gene encoding 

20 a heterologous protein of interest linked in-frame to a segment of target-site DNA 
segment homologous to DNA on the host cell chromosome and a selectable marker. 
This target site DNA sequence will typically be the 5' end of a gene located on the 
bacterial chromosome downstream from a promoter. Integration of the chromosomal 
transfer DNA into the host cell chromosome will place the gene encoding the 

25 heterologous protein of interest into operable linkage with a bacterial promoter. The 
target sequence on the host cell chromosome may be a naturally occurring sequence or 
may be a site which is introduced into the chromosome of the host cell. A target may be 
introduced into the chromosome of a host cell utilizing a DNA sequence homologous to 
a segment of the host cell chromosome, as described above for integration of the 

30 chromosomal transfer DNA. A target site may also be intioduced using site-specific 

recombination, such as the attB /attP/ INT system described above. A target site sequence 



wo 96/40722 . PCTAJS96/09006 

20 

is at least about 1 0 bases long, preferably at least about 30 bases long, and most 
preferably at least about 100 bases long. The DNA sequence on the chromosomal 
transfer DNA and the target site are at least about 80% homologous, preferably at least 
about 90% homologous and most preferably at least about 95% homologous. A target 
5 site is preferably rare in the host cell chromosome and, more preferably, is unique in the 
host cell chromosome, bitegration of the chromosomal transfer DNA using a sequence 
homologous to a segment on the host cell chromosome facilitates amplification of the 
integrated DNA by placing duplicate DNA sequences flanking the integrated DNA (see 
Figures 8 and 9). 

10 Introducing DNA into host cells 

A variety of methods for introducmg nucleic acids into host cells are 
known in the art, including, but not limited to, electroporation; transfection employing 
calcium chloride, rubidium chloride calcium phosphate; DEAE-dextran, or other 
substances; microprojectile bombardment; lipofection; and infection (where the vector is 

1 S an infectious agent, such as a retroviral genome). See generally, Sambrook et al., supra 
and Ausubel et al., supm. 

Host cells 

The methods of the present invention are preferably used with prokaryotic 
20 host cells, although they would be applicable to eukaiyotic host cells as well. Among 
prokaryotic hosts, gram negative bacteria are preferred, especially Escherichia coH . 
Other prokaryotes, such as Bacillus subtilis or Pseudomonas may also be used. 

Manunalian or other eukaryotic host cells, such as yeast, filamentous 
fungi, plant, insect, amphibian or avian species may also be used. See, tissue culture 
25 (Kruse and Patterson, ed., Academic Press, 1973). Useful mammalian host cell lines 
include, but are not limited to, VERO and HeLa cells, Chinese hamster ovary (CHO) 
cells, and W138, BHK, and COS cell lines. 
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Amplification of Integrated DNA 

Amplification of integrated genes can be efficiently accomplished by any 
of several methods, for example, chromosomal duplication or replicative transposition. 
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Integrated DNA which contains or is flanked by duplicate DNA sequences of 25 or more 
base pairs will form chromosomal duplications (Normark et al. (1977) J, Bacteriol. 
122:912-922; Edland et al. (1979) Mol. Gen. Genet. 122:1 15-125; Tlsty et al. (1984) 
Cell 22:21 7-224; Stemet al, (1984) £sil 22:1015-1026). Selection for duplications 
5 (amplification) is greatly facilitated if the duplicate DNA contains a selectable marker, 
such as an antibiotic resistance gene or a gene which complements a host cell deficiency. 
Preferably the integrated DNA includes two selectable markers; a first selectable marker 
which is operable at low copy number and is used to select for integrants, and an second 
selectable marker which requires high copy number and is used to select for host cells 
10 which have amplified the integrated DNA. Amplification may also be accomplished by 
replicative transposition, in the case where the chromosomal transfer DNA contains the 
appropriate transposon sequences or the chromosomal transfer DNA is integrated into a 
transposon. Preferably, amplification is accomplished by selection for chromosomal 
duplications. 
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Production of non-bacterial proteins 

Following integration of the chromosomal transfer DNA into the host cell 
chromosome, and optionally following amplification of the integrated DNA, the foreign 
gene may be expressed, resulting in the production of the non-bacterial protein of 
interest. It is preferable that the promoter controlling expression of the integrated gene 
be controllable (i.e., inducible), so that any toxic effects of the gene product can be 
minimized. Following expression of the foreign gene, the protein product may be 
purified. As will be apparent to one skilled in the art, the purification method used will 
depend on the identity of the foreign protein. 

The invention will be better understood by reference to the following 
examples, which are intended to merely illustrate the invention. The scope of the 
invention is not to be considered limited thereto. 

EXAMPr.KS 

Example 1 

Integration of a chromosomal transfer DNA comprising a foreig n gene into the 

chrpmgMme of E. gpli strain B1384 

The general strategy for integrating a chromosomal transfer DNA 
comprising a foreign gene into the chromosome of coli is depicted schematically in 
Figure 1 . Two plasmids were constructed: pDM25432 contained a foreign gene of 
mterest (in this example, an IGF-I fusion gene) lacking an operably linked bacterial 
promoter; pDM25423 contained a T7 promoter. By ligating restriction fragments 
purified from each of these vectors, a DNA circle lacking an origin of replication - -a 
chromosomal transfer DNA- - was generated. This chromosomal transfer DNA 
contained an antibiotic resistance gene which affords resistance to chloramphenicol 
(CAM-r) and a site-specific recombination site from phage lambda, gfiP. This 
chromosomal transfer DNA is transformed into a bacterial strain such as E. coli B 1 3 84 
(Mascarenhas et al, (1983^ Virology 124 :lQ0->10g) (Figure 2), which makes the enzyme 
integrase (INT) under the control of the ^ promoter, which can be induced during 
transformation by adding 1 mM indole acrylic acid (lAA) to the medium. B1384 also 
contains an afi P in its chromosome. Integrase recognizes the alt P sites on the 
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chromosomal transfer DNA and in the chromosome of B1384 and catalyses their 

recombination, leading to the site-specific integration of the chromosomal transfer DNA 

into the bacterial chromosome at the att P site (Weisberg et al. Comprehensive Virolog y , 

vol. 8, pp. 197-258 (Plenum, Fraenckel-Conrat and Wagner, eds., New York, NY, 1977). 

5 Bacterial host cells bearing the integrated DNA are selected for on the basis of their 

resistance to chloramphenicol. 

Chloramphenicol-resistant chromosomal integrants were tested as 

summarized in Figure 3. The presence of the integrated chromosomal transfer DNA was 

confirmed by amplifying host diromosomal DNA by PGR with the following primer sets 

10 (e.g., UBUF X IGFR, 1243 x T7REV, or TRPPF x 1239) 

IGFR: 5' ... CCC ATC GAT GCA TTA AGC GGA TTT AGC 

CGGTTTCAG...3' 

#1239: 5'...GCC TGA CTG CGT TAG CAA TTT AAC TGT 
15 GAT...3' 

#1243: 5;...CTGGGCTGCTTCCTAATGCAGGAGTCG 
CAT...3' 

20 #1227: 5'...TAA TAG GAC TCA CTA TAG GGA GA...3' 

TRPPF: 5'...GAT CTG TTG ACA ATT AAT GAT GGA ACT 
AGT TAA CTA GTA CGC AAG TT...3' 

25 T7REV: 5'...TGG TAG TTA TTG GTG AGC GG...3' 

GYCFl: 5'...CAG GAT CCG ATC GTG GAG GAT GAT TAA 
ATG GGG AAA GGG GAC CCG CAC...3* 

30 CYCRl: 5'...CAGGAAGGTTACGGC AGG AGTTTAGGG 

GAAAG...3* 
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UBUF: 5'...GGGGCGGGGGTGGGATGCAGATTTTCG 
TCAAGACTTTGA...3' 

The amplified fragments were digested with the appropriate restriction 
enzyme (SacII, HinCII, or BamHI, respectively). The products were sized by agarose gel 
electrophoresis. Presence of the integrated sequences was demonstrated by amplification 
of: 
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• chromosomal ubiquitin and IGF sequences, demonstrating 
the presence of the relevant foreign gene; 

• chromosomal tet and T7 sequences, demonstrating the 
juxtaposition of the T7 promoter and the fusion gene; and 

• adjacent chromosomal tip and tet sequences, demonstrating 
insertion of the chromosomal transfer DNA at the expected location. 

The chromosomal integration of the chromosomal transfer DNA was also confirmed by 
the following evidence: . 

• resistance of the bacterial host to chloramphenicol; 

• no plasmid DNA in DNA minipreps; 

• lack of beta-lactamase enzymatic activity, confirming the 
. absence of the parental plasmids G^eta-lactamase was assayed using a 

chromogenic substrate, 7-thienyI-2-acetamido-3-2-4 
n,n*<limethylaminphenylazopyridiniummethyl-3cephem-4 carboxylic acid 
(PADAC), as described in enzyme INHlBrroRs pp. 169-177 (Verlage 
Chemie, Broderick, V., ed); and 

• segregation analysis: Isolates were grown in L broth with 
or without 1 mM lAA at 37^ C overnight and plated on LB agar plates. 
Single colonies from each culture were tested for retention of 
chloramphenicol resistance. 100% retention was observed from cultures 
without lAA; 1 1% retention was observed m cultures with lAA. 

Six of seven isolates tested showed the expected phenotypes. 

B 1 384 does not contam the gene for T7 RNA polymerase. In order to test 
the expression of the chromosomal constructs, PI lysates were prepared on each of the 
six strains carrying the integrated DNA and used to transduce strain W31 10DE3 to 
chloramphenicol resistance (a short course in bacterial genetics: a laboratory 

MANUAL AND HANDBOOK FOR ESCHERICHIA COLl AND RELATED BACTERIA (Cold Spring 

Harbor Laboratory Press, Miller, J.H., ed., 1992)). Strain W3110DE3 carries the T7 
RNA polymerase gene under the control of the lac promoter. It is also GaT, unlike 
B1384. Transductants were therefore selected on galactose minimal plates containing 20 
Tg/ml chloramphenicol. Single colonies from each transduction experiment 
(independent donors) were purified and tested further. 
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• The results obtained were identical in ail six independent 
cases: the chromosomal transfer DNA was transferred with high 
efficiency to a new location on the bacterial chromosome, the sH sites 
flanking the prophage in W3 1 1 0DE3. This was confirmed by 

5 • chloramphenicol resistance; 

• no plasmid DNA in DNA minipreps; 

• ill immunity (DE3 lysogen; phage lysates were plated on 
bacterial lawns by standard techniques); 

• gal"^ (i.e. growth on galactose minimal plates); 

10 • expression of IGF protein under lac control (expression 

and analysis carried out as described in Example 1 or co-owned, co- 
pending U.S. patent application Serial No. 08/101,506, filed August 2, 
1993). 

Chromosomal DNA from the six strains ("integrants"*) was digested to 
1 5 completion with Bglll and Ncol and a Southern blot of the digested DNA was probed 
with a labeled 0.6 kb DsbA DNA probe which covers the entire gene sequence coding for 
mature DsbA (Bardwell et al. (1991) Cell 62:581-589; see also Kamitani et al. (1992) 
EMBQ J 11:57-62). Each of the six integrants contained insertions; the blots 
demonstrated the existence of several double insertions, one single insertion, and one 
20 (isolate WB3-6) apparently duplicate double (i.e. triple) insertion. 

The six integrants were tested for e?q)ression of the IGF fusion protein 
after induction with isopropyl-J-thiogalactopyranoside (IPTG). Cells were induced with 
IPTG for two hours and whole cell extracts for the induced integrants, as well as size 
markers and an IGF fusion protein control, were separated by 12% SDS-PAGE, Westem 
25 blotted, and reacted with polyclonal anti-IGF sera (see Example 1 of co-owned, co- 
pending U.S. patent application Serial No. 08/1 0 1 ,506, filed August 2, 1 993) (Figure 4), 
Isolate WB3-6 (Figure 4, lane 6) showed the highest levels of expression of the IGF 
fusion protein. An induced band of the same size was also seen on Coomassie 
blue-stained gels. 

30 A different binary system was used to generate a chromosomal transfer 

DNA carrying a kanamycin resistance marker. The plasmids used, pDM25424 and 
pDM25427, are described in the figures. The configuration and location of the insert 
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were confirmed by PGR, giving results which were virtually identical to those described 
above. After transduction into the W31 10DE3 background, several individual isolates 
were obtained which expressed the IGF fusion protein at level that could be easily 
detected by Western blotting (Figure 5). Procedures used were identical to the ones 
5 described above for the chloramphenicol-resistant isolates, except that the antibiotic and 
resistance gene were kanamycin instead of chloramphenicoL Purified fusion protein was 
the control. Lanes 1 and 2 contain whole cell lysates fi'om two transducted isolates. 

The construction of the vectors employed in the two binary systems is 
summarized in Figures 10-13. The sources for the plasmids employed were: pBR322, 

10 pUC18, pUC19, pKK233.2, ptRC99A, pCHl 10, and pNEO (Pharmacia, Piscataway, 
NJ);.pLG339HLY (Dr. Bany Holland, Institute de G^n^tiques et Microbiologie, 
University Paris-Sud); pRcCMV (Invitrogen, San Diego, CA); pACYC177 and 
pACYC184 (New England BioLabs, Beverly, MA); pET3b (Studier and Moffat (1986)1 
MfiLfiifiLijB2:113-130); pYZ22070 (described in Example 1 of co^owned, co-pending 

IS U.S. patent application Serial No. 08/101,744, filed August 2, 1993). 

E. cqH K-12 strain W31 10 was obtained &om B. Bachmann, ECGSC, 
Yale University. It was lysogenized with the DE3 defective phage as described by 
Studier and Moffat (1986) LidQLEiQLI£&:l 13-130. W31 10DE3 was one such 
lysogen. The cyclophilin gene was amplified by the polymerase chain reaction (PGR) 

20 fi-om W31 10 using the primers CYCFl and CYCRl (see above). 

Example 2 

Chromosomal exnression of a DsbA!!ubiquitin!!TGF>I fiision gene 

A DsbA::ubiquitin::IGF-I fiision gene was assembled and integrated into 

25 the chromosome of bacterial host cells with a chromosomal transfer DNA produced 
using the double-cassette binary system. The strategy for constructing the double 
cassette binary system vectors is shown in Figure 14. The general strategy for 
constructing a chromosomal transfer DNA (CTD) with the double cassette system is 
shown m Figure 7. The strategy used to create the chromosomal transfer DNA canying 

30 the DsbA::ubiquitin::IGF-I fiision gene is shown in Figure 12. Following chromosomal 
integration, the fiision gene was expressed, resulting in extremely high levels of protein 
accumulation. 



wo 96/40722 PCT/US96/09006 

27 

The double cassette binary system utilizes two plasmids, pDM2S470 and 
pDM2S46S, as shown in Figures 7 and 14. pDM25425 is a pUC19 derivative carrying a 
copies of aJtP» the T7 promoter, and a copy of the rmtlt2 temiinator, from which a 1 .6 kb 
fragment was deleted by Bglll/BamHI digestion. A terminator and a sequence encoding 
5 DsbA (a 1 .5 kb NcoI(fill)/NsiI fragment from pDM25463) was added ligated to 
EcoRI(fill)/NsiI-digested pDM25459 to form pDM25470 (one of the double cassette 
binaries). The other double cassette plasmid, pDM25465, carries two copies of a 
terminator, a kanamycin resistance gene, and the cyclophilin gene (the use of the 
cyclophilin gene to aid in protein production is described in co-owned, co-pending U.S. 

1 0 patent application Serial Number 08/1 01 ,506, incorporated herein by reference in its 
entirety). The cyclophilin gene was cloned from pER1595 1 (HinDin(fill)/XbaI, 0.6 kb 
fragment) into pDM25424 (BamHI(fill)/XbaI, 52 kb fragment; a pUC19 backbone 
carrying two copies of a terminator and a kanamycin resistance gene). The kanamycin 
resistance gene in pDM2S430 (derived from pDM2S424) was insufficiently effective, so 

1 5 it was replaced with a kanamycin resistance gene from pLG339hly (PvuII/EcoRI digest), 
creating plasmid pDM2S443. The T7 promoter was cloned into pDM25443 by annealing 
oligos T7F and T7R and ligating them the EcoRI-digested pDM25443, creating 
pDM25465. 

Two sets of oligonucleotides were synthesized (1 , 2, IR, 2R and 3, 4, 3R, 
20 4R), phosphorylated, denatured, and annealed. The annealing product of 1, 2, IR, and 
2R, vAdch encodes ubiquitm, was ligated into pUCl 8 (Sphl-BanoJil digest). The 
annealing product of 3, 4, 3R, and 4R, which encodes IGF-I, was ligated into pUClS 
(EcoRI-BamHI digest). The resulting plasmids were transformed into JM109 and the 
transformed host cells were selected on ampicillin plates. Transformants were analyzed 
25 for the presence of the ubiquitin and IGF-I sequences, then sequenced to identify 
correctly formed constructs. One isolate from each was selected, and designated 
pP039354 and pP039334, respectively. 
1 

5'-CAG ATI TIC GTC AAG ACT TTG ACC GGT AAA ACC ATA 
30 ACA TTG GAA GTT GAA CCT TCC GAT ACC ATC GAG AAC GTT 

AAG GCG AAA ATT CAA GAC AAG GAA GGT ATC CCT CCA 
GATCA-3' 



wo 96/40722 



28 



PCTAJS96/09006 



2 

5'-ACA AAG ATT GAT CTT TGC COG CAA GCA GCT AGA AGA 
CGG TAG AAC GCT GTC TGA TTA CAA CAT TCA GAA GGA GTC 
CAC CTT ACA TCT TGT GCT AAG GCT CCG CG-3' 

IR 

5'-ATA CCT TCC TTG TCT TGA ATT TTC GCC TTA ACG TTC TCG 
ATG GTA TCG GAA GGT TCA ACT TCC AAT GTT ATG GTT TTA 
CCG GTC AAA GTC TTG ACG AAA ATC TGC ATG-3' 

2R 

5'-GAT CCG CGG AGC CTT AGC ACA AGA TGT AAG GTG GAC 
TCC TTC TGA ATG TTG TAA TCA GAC AGC GTT CTA CCG TCT 
TCT AGC TGC TTG CCG GCA AAG ATC AAT CTT TGT TGA TCT 
GGAGGG-3' 

3 

5'-GAT CCC CGC GGT GGT GGT CCG GAA ACC CTG TGC GGT 
GCT GAA CTG GTT GAC GCT CTT CAG TTC GTT TGC GGT GAC 
CGT GGT TTC TAC TTC AAC AAA CCG ACC GGT TAC GGT TCC 
TCC TCC CGT CGT GCT CCG CAG-3' 

4 

5'-ACC GGT ATC GTT GAC GAA TGC TGC TTC CGG TCC TGC 
GAC CTG CGT CGT CTG GAA ATG TAC TGC GCT CCG CTG AAA 
CCG GCT AAA TCC GCT TAA TGC ATC GAT CTC GAG-3' 

3R 

5'-AGC ACG ACG GGA GGA GGA ACC GTA ACC GGT CGG TTT 
GTT GAA GTA GAA ACC ACG GTC ACC GCA AAC GAA CTG 
AAG AGC GTC AAC CAG TTC AGC ACC GCA CAG GGT TTC CGG 
ACC ACC ACC GCG GG-3 ' 

4R 

5'-AAT TCT CGA GAT CGA TGC ATT AAG CGG ATT TAG CCG 

GTT TCA GCG GAG CGC AGT ACA TTT CCA GAC GAC GCA GGT 
CGC AGG ACC GGA AGC AGC ATT CGT CAA CGA TAC CGG TCT 
GCGG-3' 

The ubiquitin and IGF-I sequences were isolated firom pP039354 and 
pP039334 (by Sphl-SacII and SacII-Nsil digests, respectively), and cloned into 
Sphl-Nsil digested pDM25454 (a pUC19-based plasmid canying a sequence coding for 
DsbA), to create a plasmid, designated pP039358, containing a DsbA::ubiquitin::IGF-I 
fusion gene. 
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The fusion gene from pP039358 was ligated into the double-cassette 
binary parent vectors pDM25470 and pDM25465 to create pP039377 and pP041623, 
respectively. EcoRI-Xbal fragments of pP039377 and pP041623 v^ere ligated to form 
the chromosomal transfer DNA (Figure 17). 

The chromosomal transfer DNA was transformed into E. coh' strain 
B1384, which contains an at£ site as well as a sequence, under the control of the trp 
promoter, encoding the enzyme integrase (INT). Indole acrylic acid (1 mM) was added 
to induce the expression of INT and resulted in the integration of transduced 
chromosomal transfer DNAs. Cells were tested for chromosomal transfer DNA 
integration by: 

Blue/vellow screening Cells were tested for integrated DNA by 
blue/yellow screening with AmpScreen (BRL). Colonies with a blue 
phenotype were further screened, yellow colonies were discarded. 
PCR Cells were tested for properly integrated DNA by amplification of 
host cell chromosomal DNA using primer pairs: 
T7F1 5'-AAT TGT CGA CAT TAA TAC GAC TCA CTA TAG GGA 
GAC CAC AAC GGT TTC CCT GAA TTG TCG ACA TTA ATA CGA 
CTC ACT ATA GGG AGA CCA CAA CGG TTT CCC TG-3* 

IGFREV 5'-CCC ATC GAT GCA TTA AGC GGA TTT AGC CGG TTT 
CAG-3* 

which confirm the presence of the complete fiision gene with its promoter and 
T7REV 5'-TGC TAG TTA TTG CTC AGC GG-3* 

TRPBR2 5'-AAG GGC TTC ATC ATC GGT AAT AGA CA-3' 

which confirm the integration of the chromosomal transfer DNA into the £tt site of 
B1384. 

Production of protein from integrated genes requires T7 RNA polymerase 
activity, which is lacking in B1384. To test protein production from the integrated gene, 
PI lysates were made using a B1384 mtegrant. The lysates were then transduced into K 
fiOli strain W3 1 1 0DE3 (as described in Example 1), which is Gal* and carries a copy of 
the T7 RNA polymerase gene under the control of the lae promoter. Transductants were 
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selected by plating on galactose minimal medium plates which contained 10 1 g/ml 
kanamycin. Single kanVGal* colonies were isolated and reselected on galactose minimal 
medium plates with kanamycin. KanVGal"^ colonies were further analyzed by PGR using 
primer pairs: 

ATT3 5'-GAG GTA CCA GCG CGG TTT GAT CAG-3* 
T7RNAP1 5'.CAG CGT TAT CCG CAA CCT CAC C-3' 

which showed that the upstream site flankmg the prophage in W3 1 10DE3 is 
unoccupied; and 

T7F1 5*.AAT TGT CGA CAT TAA TAG GAG TCA GTA TAG GGA 
GAG CAC AAC GGT TTC CCT GAA TTG TCG ACA TTA ATA CGA 
CTC ACT ATA GGG AGA CCA CAA CGG TTT CCC TG-3' 

IGFREV 5'-CCC ATC GAT GGA TTA AGC GGA TTT AGC CGG TTT 
CAG-3' 

which confirmed that the fusion gene e>q)ression cassette was transferred intact. 

Individual isolates from the W3 1 10DE3 transduction were tested for T7 
RNA polymerase activity by streaking the isolates against phage 4107, which requires T7 
RNA polymerase activity to lyse bacteria (Novagen). An isolate which contained an 
intact fusion gene expression cassette and which was positive for T7 RNA polymerase 
activity, designated c49222, was used to test protein production. Protein expression was 
induced by the addition of IPTG to the a culture of c49222 for two hours. Protein 
production was analyzed by SDS-PAGE of a whole cell lysate on a 12.5% aaylamide 
gel. Densitometric analysis of and SDS-PAGE gel showed that the 
DsbA::ubiquitin::IGF-I fusion protein accumulated to 22.3% of total cell protem. 

P 1 ly sates were also used to transduce the integrated gene into E. coli 
strain cDM46809 (which is cam^ malE deleted, and contains an attB site introduced into 
the lac region). Transductants were selected by growth on plates containing kanamycin 
and chloramphenicol. Integration mto the lac region was confirmed by PGR using 
primer pair: 

UBIl 5*-CAG ATT TTC GTC AAG ACT TTG AGC GGT AAA ACC 
ATA ACA TTG GAA GTT GAA CCT TCG GAT ACC ATC GAG AAC 
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GTT AAG GCG AAA ATT CAA GAC AAG GAA GGT ATC CCT 
CCAGATCA.3' 

1224 5*-CGC CAG GGT TTT CCC AGT CAC GAC-3' 

5 

A PI lysate was then made from an isolate which was kanVcam' and 
integrated into the lac region. This PI was used to transduce W3 1 1 0DE3. Transductants 
were selected for kanamycin and chloramphenicol resistance by growth on selective media. 
KanVcam^ isolates were tested for T7 RNA polymerase activity by streaking against phage 

10 41 07 as described above. Two isolates positive for T7 RNA polymerase activity, 

designated c49258#46 and c49258#50, were tested for protein accumulation by induction 
. with IPTG for two hours. Whole cell lysates were analyzed by SDS-PAGE using 12.5% 
acrylamide gels. DsbA::ubiquitin::IGF-I fusion protein accumulated to 1 9.6% of total cell 
protein in c49258#46, as measured by densitometry of an SDS*PAGE gel. 

IS Southern blot analysis of chromosomal DN A from c49222and 

c49258#46 was perfomied to check the copy number of the integrated DNA. 
Chromosomal DNA from c49222 and c492S8#46 v/as isolated, digested with restriction 
endonucleases, transferred to Hybond-N (Amersham), and probed with the a DNA 
fragment encoding the ubiquitin and IGF-I portions of the fusion protein. Analysis of the 

20 Southern blot showed that there were approximately two copies each of the 

DsbA::ubiquitin::IGF*I gene integrated into the chromosomes c49222 and c492S8#46 
(Figure 24), i.e. a single copy of the integrated DNA). This result was surprising and 
unexpected in view of the levels of accumulation of DsbA::ubiquitm::IGF-I protein 
shown by SDS-PAGE (22.3% and 19.6% of total cell protein, respectively). Ordinarily, 

25 it is expected that such high levels of protein accumulation can only be accomplished by 
expression of heterologous genes carried by high copy number plasmids. 

DsbA: :ubiquitin: :IGF-I was also produced by integrating a chromosomal 
transfer DNA carrying a gene for tetracycline resistance in addition to the gene for 
kanamycin resistance. PI lysates prepared from a B1384 integrant were used to 

30 transduce W3 1 10DE3 to kanamycin resistance (see Example 1). Kan' isolates were 
checked for properly integrated DNA using primer pairs T7F1 x IGFREV and ATT3 x 
T7RNAP1 as described above. Isolates were also tested for T7 RNA polymerase activity 
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by streaking against phage 4107 as described above. Isolates positive for T7 RNA 
polymerase activity were then selected for amplification of the integrated DNA by 
growth on medium containmg kanamycin (10 |ig/ml) and tetracycline (30^g/mI). The 
tetracycline allele incorporated into this construct is effective at high copy number, 
therefore colonies which are tetracycline resistant may have amplified the integrated 
DNA. KanVtef colonies were tested for protein accumxilation by induction with IPTG, 
as described above. All kanVtet' colonies produced the fusion protein upon induction. 

Examples 

Chromosomal expression of ubiouitin hy drolase rUBP-H 

The construction of plasmids used in this example is described in Figures 
10-16. pJT70 was the source of the ubiquitin hydrolase. pDM25493 was the source of 
the lot promoter used for this construct chromosomal transfer DNA's for the yeast 
UBP- 1 gene under the control of the fip promoter were prepared from pDM468 1 3 and 
either pDM25472 or pDM25448. In this example, pDM25472 was used (i.e. 
chromosomal transfer DNA#1 of Figure 16). The fusion gene formed by this 
chromosomal transfer DNA encodes an in-frame fusion between a truncated DsbA gene 
and a UBP- 1 cDNA missing the amino-terminal 92 codons. 

The chromosomal transfer DNA was introduced into B1384 as in 
Example 2. Integrants were selected for with kanamycin (10 ^g/ml). Isolated colonies 
were tested m a diagnostic PGR reaction using primers TRPPF and 1239 (as described in 
Example 1). All isolates were positive by this test. All isolates were also ampicillm 
sensitive. 

One colony was selected for further characterization. PI lysates were 
prepared of this isolate and used to transduce W31 10DE3 to kanamycin resistance as 
described in Example 1 . Kanamycin resistant colonies were further tested by PGR using 
primers ATT3 and T7RNAP1, as described in Example 2. All isolates showed tiie 
expected location at the attEZE or attP/B sites flanking the DE3 lysogen. 

The isolates were tested for protein expression by testing for ubiquitin 
hydrolase activity. Isolates were grown in casamino acid minimal medium, harvested 
and lysed by sonication. The soluble fraction was assayed for activity by incubation with 
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DsbA::ubiquitin: :IGF-I fusion protein substrate at 37° C for one hour. Cleavage was 
monitored by SDS-PAGE. All isolates (WBD311, 312, 313, 314, 331, and 332) showed 
good levels of enzyme activity (i.e. complete cleavage of the substrate under assay 
conditions). 
5 . 
Example 4 

Expression of an insulin-like growth factor binding protein.3 rTGFBP-3'> fiirinn pmtein 
A chromosomal transfer DNA carrying a fusion protein comprising DsbA, 

a linker including a human rhmovirus 2A protease site, and IGFBP-3 
10 (DsbA::2A::IGFBP-3) was created using the double cassette method. Construction of 

the fusion gene and chromosomal transfer DNA are shown in Figure 18. DsbA was from 

pDM46905, Hoe 2A protease site was created by annealing primers V2ATA and V2ATB, 

and IGFBP-3 was PCR amplified firom pYZ42580 using primers BP3RZ and NBP3F. 

The IGFBP-3 geas used to create ±e DsbA::2A::IGFBP-3 fusion was 
1 5 created by annealing and ligating a number of synthetic oligonucleotides, vAach, when 

fully assembled, code for IGFBP-3 protein. The oligonucleotides were assembled in 

three segments; 5', 3', and middle. Oligonucleotides 

Fl-1 5'-AGC TTG GTG CTT CTT CTG CTG GTC TTG GAC CAG 
TTG TTC GTT GTG AAC CAT GTG ATG CAC GAG CTT TAG CTC 
20 AATGTGCTCCACCACCAGCTGTT-3', 

Fl-2 5'-TGT GCT GAA TTA GTT CGA GAA CCA GGT TGT GGT 
TGT TGT TTA ACT TGT GCT TTA TCT GAA GGT CAA CCA TGT 
GGT ATT TAT ACT GAA CGT TGC GG-3', 

25 

Fl-3 S'-TAG TGG TTT GCG TTG TCA ACC AAG CCC AGA TGA 
AGC TAG GCC TTT ACA AGC ATT ATT AGA TGG TCG AGG TCT 
GTG TGT TAA TGC GTC CGC TGT TTC TCG ATT GCG CGC Q-V, 

30 Cl-1 5'-TCGACGCGCGCAATCGAGAAACAG.CGGACGCAT 

TAA CAC ACA GAC CTC GAC CAT CTA ATA ATG CTT GTA AAG 
GCC TAG CTT CAT CTG GGC TTG GTT G-3 ', 



35 



Cl-2 5'-ACA ACQ CAA ACC ACT ACC GCA ACQ TTC AGT ATA 
AAT ACC ACA TGG TTG ACC TTC AGA TAA AGC ACA AGT TAA 
ACA ACA ACC ACA ACC TGG TTC TC-3', 
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and 



Cl-3 5'-GAA CTA ATT CAG CAC AAA CAG CTG GTG GTG GAG 
CAC ATT GAG CTA AAG CTC GTG CAT CAC ATG GTT CAC AAC 
GAA CAA CTG GTC CAA GAC CAG CAG AAG AAG CAC C-3' 

wo-e annealed and ligated to form the 5' segment of the IGFBP-3, then cloned into 

pUC18 (HinDin-Sall digest); this construct was designated pYZ37437. The 3' section 

of the gene was created by annealing and ligatmg oligonucleotides 

F-1 5'-TCG ACG TGA GAT GGA GGA TAG CTT AAA CCA TTT 
AAA ATT TTT GAA CGT TTT ATC CCC GCG TGG COT TCA TAT 
CCC GAA TTG CGA T-3*, 

F-2 5'AAAAAAGGCTTCTACAAAAAG AAACAATGCCGT 
CCG AGT AAG GGT CGT AAA CGA GGT TTT TGT TGG TGC GTT 
GACAAATACGGT.3', 

F-3 5'-CAA CCG TTG CCG GGT TAT ACT ACT AAA GGC AAA 
GAA GAT GTT CAT TGT TAT TCT ATG CAA TCT AAA TAA TGC 
ATCTCGAG-3', 

C-1 5'-AAT TCT CGA GAT GCA TTA TTT AGA TTG CAT AGA 
ATA ACA ATG AAC ATC TTC TTT GCC TTT AGT AGT ATA ACC 
CGG C-3', 

C-2 S'-AAC GGT TGA CCG TAT TTG TCA ACG CAC CAA CAA 
AAA CCT CGT TTA CGA CCC TTA CTC GGA CGG CAT TGT TTC 
TTTTTGTAGAAG-3', 



and 



C-3 5'-CCT TTT TTA TCG CAA TTC GGG ATA TGA ACG CCA 

CGC GGG GAT AAA ACG TTC AAA AAT TTT AAA TGG TTT AAG 
GTA TCC TCC ATC TCA CG-3', 

followed by clonmg into Sall-EcoRI digested pUC18 (designated pYZ37405). 

pYZ374100 contained the middle segment of the IGFBP-3 gene and was created by 

annealing and ligating oligonucleotides 

MFl 5'-CGC GCT TAT TTA TTA CCT GCC CCA CCG GCA CCG 
GGT AAC GCC TCC GAA A-3 *, 
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MF2 5'.GCGAAGAGGATCGTTCTGCGGGTTCCGTTGAAT 
CTC CAA GTG TGA GTT CTA CCC ATC GAG TTA GCG ACC CGA 
AA-3', 

MF3 5'-TTTCATCCGTTGCACTCTAAAATCATTATTATT 
AAA AAG GGT CAC GCA AAG GAT TCT CAA CGT TAT AAG 
GT-3', 

MF4 5*.GGA TTA TGA AAG CCA ATC TAC CGA CAC TCA AAA 
TTT TAG TAG TGA AAG TAA ACQ TGA AAC CGA GTA CGG CCC 
GTG-3', 

MBl 5»-TCG ACA CGQ CCC GTA CTC GGT TTC ACQ TTT ACT 
TTC ACT ACT AA-3', 

MB2 S'-AAT TTT GAG TGT CGG TAG ATT GGC TTT CAT AAT 
CCA CCT TAT AAC GTT GAG AAT CCT TTG CGT GAC CCT TTT 
T-3', 

MB3 5'-AATAATAATGATTTTAGAGTGCAACGGATGAAA 
TTT CGG GTC GCT AAC TCG ATG GGT AGA ACT CAC ACT TGG 
AGATT-3', 



MB4 5'-CAA CGG AAC CCG CAG AAC GAT CCT CTT CGC TTT 
CGG AGG CGT TAC CCG GTG CCG GTG GGG CAG GTA ATA 
AATAAG-3', 

digesting the ligated DNA with BssHII and Sail, end filling with Klenow Ihen cloning 

into Klenow-fiUed, Xbal-digested pUC18. 

PGR amplification of a segment of pYZ37490 was used to add a SacII site 

and repair a cloning artifact. Primer pairs 

pFl 5'-GGT TGT TGT TTA ACT TGT GCT TTA TCT GAA GGT 
CAA CCA TGT GGT ATT TAT ACT GAA CGT TGC GGT AGT GGT 
TTG CGT TGT CAA CCA AGC CCA GAT GAA GCT AGG-3' 

1233 5'-AGCGGATAACAATTTCACACAGGA-3' 



pRl 5'-TAA AGC ACA AGT TAA ACA ACA ACC ACA ACC TGG 
TTC TCG AAC TAA TTC AGC ACA AAC AGC TGG TGG TGG AGC 
ACA TTG AGC TAA AGC TCG TGC ATC ACA TGG T-3' 
1224 5'-CGC CAG GGT TTT CCC AGT CAC GAC-3' 
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were used to introduce the restriction site and repair the defect. The two PCR amplified 

fragments were then mixed and amplified to form a single DNA using primer pair 1233 x 

1224. The resulting DNA segment was cloned into HinDIII-Sall digested pYZ37437, 

creating pYZ37490. 

PCR amplification was also used to introduce an additional SacII site, to 

facilitate later cloning steps. Primers 

5pMP 5'-GAC TGC AAG CTT CCG COG TGG TGG TGC TTC TTC 
TGCTGGTCTTGGA-3' 

and 

1233 5'-AGCGGATAACAATTTCACACAGGA-3' 

were used to amplify a segment of pYZ37490, which was then ligated into HinDEI-Sall 
digested pYZ37490, forming pYZ42519. 

The IGFBP-3 gene was assembled finm the three segments in a three-way 
ligation reaction. pYZ42S19 (HmDm-BssHII digest), pYZ374100 (BssHD-Sall digest) 
and pYZ37405 (Sall-EcoRI digest) were ligated into HinDni-EcoRI digested pUC18. A 
properly assembled clone was identified by restriction mapping and sequencing. 

Cloning artifacts were repaired using PCR. BPFIXl was created by 
amplifying pYZ42509 with primers 

YZMl 5'-CTC GAT TGC GCG CTT ATT TAT TAC C-3' 

and 

YZM2 5'-TCT CAC GTC GAC ACG GGC CGT ACT CGG TTT CAC 
GTT TAC TCA GTA CTA AAA T-3', 

and cloning the resulting fiagment (BssHII-Sall digested) into a BssHII-Sall digest of 
pYZ42509. A HinDni-BssHII digest of BPFIXl was ligated with a HinDni-BssHH 
digest of pYZ425 1 9 to create p YZ42529. A second repair was made using primer pairs 
715Fr' 5'-TGT TGG TGC GTC GAC AAA TAC GOT C-3* 

1233 5'-AGCGGATAACAATTTCACACAGGA-3' 

and 

715R' 5'-GACCGTATTTGTCGACGCACCAACA-3' 
1224 5'-CGC CAG GGT TTT CCC AGT CAC GAC-3'. 
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This repaired a cloning defect and added a Sail site. Two DNA fragments were 
amplified from pYZ42529 using primer pairs 715Fr x 1233 and 715R' x 1224. These 
two fragments were mixed and PCR amplified into a single DNA fragment using 1233 x 
1224. This single fragment was digested with BssHI and Sail, then ligated into a 
5 BssHI-Sall digest of pYZ42529, creating pYZ50559. 

pYZ42580, the donor construct for the IGFBP-3 gene, was created by 
ligation of EcoRI-SacII fragments from pYZS05S9 and pDM25497. 

The chromosomal transfer DNA canying the DsbA::2A::IGFBP-3 fiision 
gene were transfected into E. coli stram B1384, which was grown in the presence of 100 
10 IM lAA to induce the expression of INT and the integration of the chromosomal transfer 
DNA. Integrants were selected with kanamycin. All isolates were also ampicillin 
sensitive. 

Isolates were frirther characterized by diagnostic PCR amplification of the 

host cell chromosome. PCR amplification with primer pairs 

15 1227 5'-TAA TAC GAC TCA CTA TAG GGA GA-3' 

BP3-607 S^GGG ATA TGA ACG CCA CGC GGG GAT AA-3% 

INT107 5'-GCG GAG AAA CCA TAA TTG CAT CTA CTC-3' 
BP3-559 5'.CGT GAA ACC GAG TAC GGC CCG TGT C-3/ 

20 and 

T7REV 5'-TGC TAG TTA TTG CTC AGC GG.3' 

TRPBR2 5'-AAG GGC TTC ATC ATC GGT AAT AGA CA-3' 

confirmed the proper integration of the intact chromosomal transfer DNA into the 
25 chromosome at the fitt site. 

PI lysates were prepared fix>m a single isolate and used to transduce 

W31 10DE3 to kanamycin resistance (as described in Example 1). Kanamycin resistant 

isolates were assayed for T7 RNA polymerase activity by streaking against phage 4107, 

as described in Example 2. Isolates with T7 RNA polymerase activity were then tested 
30 for expression of the fusion gene by induction with IPTG, followed by analysis of protein 

expression by SDS-PAGE of whole cell lysates on 12.5% polyacrylamide gels. 

Densitometric analysis of whole cell lysates indicated that the DsbA::2A::IGFBP-3 

fiision protein accxmiulated to a level of 22.6% (Figure 25 A). 
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Example 5 

Production IGF-I fiision proteins 

Fusion proteins including DsbA and IGF-I linked by a sequence including 
a site for either human rhinovirus 2 A or 3 C protease were produced using the double 
5 cassette binary system. Construction of binary plasmids and chromosomal transfer 
DNAs is diagramed in Figures 19 and 20. 

For expression of DsbA::2A::IGF-I, EcoRI/Xbal digest fragments of 
pPO53096 and pP0572U were ligated to forai the chromosomal transfer DNA 
(CTD-DsbA::2A::IGF-I). EcoRI/Xbal fragments of pPO53097 and pPO57210 were 

10 ligated to form a chromosomal transfer DNA carrying a gene encoding DsbA::3C::IGF-I 
(CTD-DsbA::3C::IGF-I). CTD-DsbA::3C::IGF-I and CTD-DsbA::2A::IGF.I were each 
transformed into B1384 cells in the presence of indole acrylic acid (to induce INT 
expression). Transformants were grown on media containing kanamycin to select for 
integrants. Nine individual kan^ colonies from each transformation were tested for 

1 5 ampicillin sensitivity. All tested colonies were ampicillin sensitive. 

Isolates were tested for correctly integrated DNA by PGR amplification 
with primer pairs T7F1 x IGFREV and T7REV x TRPBR2 to confirm the presence of the 
intact fusion gene and integration into the att site of B1384» as described in Example 2. 

PI lysates were prepared &om one of the B1384 integrants from each 

20 transformation and used to transduce W3 1 10DE3 to kanamycin resistance. KanVgal"^ 
isolates were tested for the presence of T7 RNA polymerase activity as described in 
Example 2. Isolates positive for T7 RNA polymerase activity were further tested by 
PGR using primer pairs T7F1 x IGFREV and ATT3 x T7RNAP1 to confirm appropriate 
integration of the intact fusion gene, as described in Example 2. 

25 Two isolates firom each transduction (c57265#44 and c57265#54 for 

DsbA::2A::IGF-I; c57264#5 and c57264#28 for DsbA::3C::IGF-I) were then grown on 
medium containing both kanamycin and tetracycline. Both DsbA::3C::IGF-I and 
CTD-DsbA::2A::IGF-I carry a tetracycline resistance allele which confers resistance 
when the gene is in high copy number. Growth in the presence of tetracycline selects for 

30 amplification of the integrated DNA. Both isolates from each transduction were kanVtef. 
The isolates were then tested for expression of the fusion proteins by induction with 
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IPTG. Protein expression was assayed by SDS-PAGE of whole cell ly sates. 
Densitometric scanning of a SDS-PAGE gel showed that the two isolates expressing 
DsbA::3C::IGF-I fixsion protein accumulated the fusion protein to 20% and 20.1% of 
total cell protein and the two isolates expressing DsbA::2A::IGF-I accumulated the 
fusion protein to 25.7% and 38% of total cell protein. 

Example 6 

Chromosomal expression of TGF-B2 using the double cassette hinarv system 

A chromosomal transfer DNA encoding a fusion protein comprising 

DsbA, ubiquitin, and human TGF-J2 (DsbA::ubiquitin::TGF-p2) was created using the 

double cassette method. Construction of the fusion gene and chromosomal transfer DNA 

are shown in Figure 21 . DsbA::ubiquitin was from pDM25497, and TGF-J2 was PCR 

amplified &om pPC-21 (Madisen et. al. (1988) I2NAJZ:l-8) using primers 

UBTGF J2F 5*-GGG GCC GCG GTG GTG CTT TGG ATG CGG CCT 

ATTGCTTTAGA-3' 

and 

TGFJ2R 5'-GGG GAA TTC TTA GCT GCA TTT GCA AGA CTT TAC 
A-3\ 

pDM25497 was digested with SacU-EcoRI and the 4.3 kb firagment 
containing pUC18 and DsbA::ubiquitin sequences was isolated- The 0.35 kb PCR 
product resulting &om the amplification of pPC-21 encoding the last 1 12 amino acids of 
human TGF-J2 was purified and digested with SacII-EcoRI. These two firagments were 
ligated to create pDP26, a pUC18 derivative containing a DsbA::ubiquitin::TGF-J2 
fusion gene. pDP26 was the donor construct for assembly of the binary plasmids used to 
make the chromosomal transfer DNA. 

The fusion gene from pDP26 was ligated into the double-cassette binary 
vectors pDM25470 and pDM25465 to create pC9DP and pA6DP, respectively. Briefly, 
pDM 25470 was digested with BamHI-Smal and the 4.2 kb fragment was isolated. 
pDP26 was digested with EcoRI, blunt ended with the Klenow fiagment of DNA 
polymerase, and then digested with BamHI. The 1.1 kb firagment firom this digest was 
isolated. The two fragments described above were ligated to create pC9DP. 
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pDM25465 was digested with BamHI, blunt ended with Klenow, digested 
with Xbal and the 7.1 kb fragment was isolated. pDP26 was digested with EcoRI, blunt 
ended with Klenow, digested with BamHI, and the 1.2 kb fragment was isolated. This 1.2 
kb fragment was ligated to the 7.1 kb fragment from pDM25465 to create pA6DP. 
5 An additional binary plasmid containing a tetracycline resistance 

selectable marker was abated using the 7.2 kb fragment isolated from pDM46932 
following Xbal-Xhol digestion, and the fiision gene from pA6DP (2.7 kb Xbal-Xhol 
fragment). These two fragments were ligated to create pA6DPnT. EcoRI-Xbal fragments 
of pC9DP (2.2 kb) and pA6DPnT (6.4 kb) were ligated to form the chromosomal 

10 transfer DNA (Figure 21). 

The chromosomal transfer DNA was transformed into E. colj strain 
Bl 384, which was grown in the presence of 500 pan lAA to induce the expression of INT 
and the integration of the chromosomal transfer. ONA. Integrants were selected with 10 
^g/ml kanamycin. All isolates were found to be ampicillin sensitive. 

1 5 Isolates were frirther characterized by dia^ostic PGR amplification of 

host cell chromosomal DNA. PGR amplifications with primer pairs 

1227 5'-TAA TAG GAG TGA GTA TAG GGA GA-3' 
P21079 5'-GGA AAT GGA TAG AGG AAG GG-3', 
and 

20 INT107 5'-GGG GAG AAA GGA TAA TTG GAT GTA GTG-3' 

6HEP2 5' GGG GGA TCC GAT GOT GGA GGA TGA TTA AAT GGA 
GGA GGA CCA CCA CCA CGA GGA GGA GAA AGG TTT GGA 
TGG GGC GTA T-3' 

25 and primers T7REV and TRPBR2, described previously (see Example 2), confirmed the 
proper integration of the intact chromosomal transfer DNA into the chromosome at the 
^site. 

PI lysates were prepared from a single isolate and used to transduce 
W3 1 1 0DE3 to kanamycin resistance, as described previously. Amplification of the 
30 integrated fijsion gene is accomplished by growth of kanamycin resistant isolates on 
medium containing kanamycin and tetracycline (30^g/ml). KanamycinAetiacycline 
resistant isolates were assayed for T7 RNA polymerase activity by streaking against 
phage 4107, as described in Example 2. Isolates with T7 RNA polymerase activity were 
then tested for expression of the fiision gene by induction with IPTG, followed by 
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analysis of protein expression by SDS-PAGE of whole cell lysates on 10% 
polyacrylamide gels (Figure 26). Protein accumulation in chromosomal integrants was 
comparable to the levels seen in host cells containing a multicopy number plasmid 
utilizing the same T7 promoter linked to the a copy of the gene encoding the 
DsbA::ubiquitin::TGF-p2 fusion protein, Densitometric analysis showed that protein 
accumulation in chromosomal integrants was as high as 36.7% of total cell protein. 

Example 7 

Expression of a heterologous protein using a prnm oter-less CTD 

This example shows the use of a chromosomal transfer DNA which does 
not carry a promoter. The chromosomal transfer DNA carries a segment of DNA 
homologous to a bacterial gene (in this example, lacZ or DsbA) linked in-frame to a 
DNA sequence encoding a heterologous protein of mterest (in this case the 
DsbA::3C::IGF-I fusion protein of Example 5), as well as selectable marker genes. The 
homologous DNA encodes the 5* region of the bacterial gene. The chromosomal transfer 
DNA is introduced into the host cell, where it integrates into the homologous gene on the 
chromosome of the host cell, forming an operable linkage between the homologous 
gene's promoter and the DNA sequence encodmg the heterologous protein of interest, 
hitegrants are selected for using the selectable markers carried on the chromosomal 
transfer DNA. The heterologous protein of interest is e3q)ressed through the homologous 
gene's promoter (Figure 8). 

The DNA encoding the DsbA::3C::IGF-I fusion protein is constructed as 
described in Example 5. This fusion gene is then placed in frame to a DNA segment 
encoding the first 100 amino-terminal amino acids of the lacZ gene, forming a 
lja£Z/DsbA::3C::IGF-I gene. The cyclophilin, kanamycin resistance, and tetracycline 
resistance genes utilized in Example S are also cloned into the plasmid. carrying the 
la£2/DsbA::3C::IGF-I gene. This plasmid is then cleaved with restriction endonucleases 
to remove the plasmid origin of replication, the ampicillin resistance gene and other 
non-essential sequences, then re-ligated to fomi a circular chromosomal transfer DNA. 
The chromosomal transfer DNA is transfomied into E. coli host cells and the 
transformed host cells are grown on media containing kanamycin (10 fig/ml). 
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Kanamycin resistant isolates are tested for ampicillin sensitivity, to show that the host 
cells carry integrated DNA, not plasmid DNA. Kan' isolates are also tested using PGR. 
PGR primers from the lacZ promoter and the DNA sequence encoding DsbA are used to 
confirm integration of the intact chromosomal transfer DNA. Amplification of the 
integrated fusion gene is selected for by growth of kan' isolates on medium containing 
kanamycin and tetracycline (30 ^ig/ml). Expression of the integrated fusion gene is 
induced by growth of kanVtet' isolates in the presence of IPTG. Protein expression is 
assayed by SDS-PAGE. 

Promoter-less chromosomal transfer DNAs may also be integrated into 
other sites on the host cell chromosome (Figure 9). Specialized host cells may be 
constructed which carry a chromosomal copy of an inducible promoter (in this case the 
T7 promoter) linked to a particular gene (in this case DsbA), This host cell is made by 
transforming a variant chromosomal transfer DNA (carrying a copy of the lacZ gene, the 
T7 promoter operably linked to the 5' end of the DsbA g»e and the chloramphenicol 
resistance gene) into the host cell(in this case W3 1 1 0DE3, which also carries a copy of 
tiie gene encoding T7 RNA polymerase). Integration of the chromosomal transfer DNA 
is selected for by growth of transformed W3 1 10DE3 cells on medium containing 
chloramphenicol. The integration of the chromosonud transfer DNA produces a 
W31 10DE3 host cell containing the T7 promoter linked to the 5' portion of the DsbA 
gene. This integrated DNA then becomes the target for mtegration of a chromosomal 
transfer DNA carrying a DNA sequence encoding the heterologous protein of interest. 

A chromosomal transfer DNA carrying the DNA sequence encoding the 
heterologous protein of interest is constructed (in this case the DsbA::3C::IGF-I fusion 
gene described above and in Example 5). The cyclophilin, kanamycin resistance and 
tetracycline resistance genes are also cloned onto the plasmid. This plasmid is then 
cleaved with the appropriate restriction euTymes to remove the plasmid origin of 
replication, ampicillin resistance gene, and other non-essential sequences, and re-ligated 
to form a circular chromosomal transfer DNA. The chromosomal transfer DNA is 
transformed into the T7-DsbA W3 1 1 0DE3 host cells described above. Integrants are 
selected by growth on medium containing chloramphenicol and kanamycin. KanVcam'' 
isolates are checked for integration of the intact chromosomal transfer DNA by PGR. 
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PGR amplification of host cell chromosomal DNA using primer pairs T7F1 x IGFREV 
confirms the integration of the intact chromosomal transfer DNA. Integrants are checked 
for T7 RNA polymerase activity by streaking against phage 4107, as described in 
Example 2. Amplification of the integrated DNA is selected for by growth of T7 RNA 
5 polymerase-positive isolates on kanamycin, chloramphenicol, and tetracycline. Resistant 
isolates are assayed for protein expression by induction with IPTG. Protein expression is 
assayed by SDS-PAGE. 

Example 8 

10 Expression of a DsbA::3C::IGFBP-3 fusion protein using the double ca ssette system 

A gene encoding DsbA::3C::IGFBP-3 fusion protein was expressed using 
the double cassette binary system shown in Figure 7,., Jhe DsfbA sequence was originally 
isolated by PGR amplification of the DsbA gene from the E. coli chromosome; plasmid 
pDM2S454 was used as the source of the DsbA sequence for this fusion gene. The site 

IS for 30 protease was created by synthesizing two oligonucleotides, 
RV3CTA 5'-CCCGATTCTCTGGAAGTTCTGTTCCAA-3' 
and 

RV3CTB 5'-TTGGAACAGAACTTCCAGAGAATCGGGCATG-3', 

which were annealed to form a double stranded DNA fragment encoding a 3C protease 

20 cleavage site. The IGFBP-3 gene was constructed by annealing and ligating synthetic 
oligonucleotides, as described in Example 4. The IGFBP-3 sequence used for 
construction of the gene encoding the DsbA::3C::IGFBP-3 fusion protein was a PGR 
amplified DNA firagment made using primers BP3RZ and NBP3F and template 
pYZ42S80. Cloning of the two DNA sources used to make chromosomal transfer DNA 

25 carrying the gene encoding the DsbA::3C::IGFBP-3 fusion protefai, pDM46947 and 
pDM46948, is shown in Figure 22. 

The chromosomal transfer DNA was constructed using EcoRI/Xbal 
fragments from pDM46947 and pDM46948. The chromosomal transfer DNA was 
transformed into B1384 cells grown in the presence of indole acrylic acid (to induce the 

30 e?q)ression of INT). Integrants were selected for by growth of transformants on media 
containing kanamycin. AH kanamycin resistant isolates were also ampicillin sensitive. 
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Kanamycin resistant isolates were checked by PGR using primer pairs 1227 x BP3-607, 
INT107 X BP3.559, and T7REV x TRPBR2, as described in Example 4. 

PI lysates were prepared from one of the kanamycin resistant isolates and 
used to transduce W3 1 10DE3 to kanamycin resistance. Kanamycin transductants were 
tested for the presence of T7 RNA polymerase activity by streaking against phage 4107, 
as described in Example 2. Kanamycin resistant/T7 RNA polymerase positive isolates 
were selected for chromosomal amplification by growth on media containing kanamycm 
and tetracycline. One kanVtef isolate was selected and checked for protein expression by 
induction with IPTG. Protein accumulation was assayed by SDS-PAGE (Figure 25 B). 
Densitometric analysis of an SDS-PAGE gel showed that the DsbA::3C::IGFBP-3 fusion 
protein accumulated to an average of 27.4% of total cell protein. 

All publications, patents and patent applications cited in this specification 
are incorporated herein by reference to flie same extent as if each individual publication, 
patent, or patent application was specifically and individually indicated to be 
incorporated by reference. 

It should be apparent that one having ordinary skill in the art would be 
able to surmise equivalents to the claimed invention which would be within the spirit of 
the description above. Those equivalents are to be included within the scope of the 
present invention. 



wo 96/40722 
What is claimed is: 



45 



PCT/US96/09006 



1 . A method for producing a heterologous protein of interest, comprising 

the steps of: 

5 transferring a chromosomal transfer DNA into a bacterial host cell, 

wherein said chromosomal transfer DNA comprising at least one copy of 
a gene encoding the heterologous protein of interest and a selectable marker, 
and wherein said host cell comprising a chromosome; 
selecting for integration of said chromosomal transfer DNA into said host 
10 cell chromosome resulting in a host cell chromosome comprising a gene encoding a 
heterologous protein of interest operably Imked to a promoter functional in the host cell 
and a selectable marker flanked by duplicate DNA; and 
expressing said gene, 

wherein said gene is at no time operably linked to a promoter functional in 
1 5 a host cell on a multicopy number plasmid vector during construction of the transfer 
DNA and 

wherem said non-bacterial protein of interest accumulates within said host 
cell to a level in excess of 0.1% of total cell protein. 

20 2. The method of claim 1 wherein said chromosomal transfer DNA 

further comprises a promoter functional in said host cell, said promoter being operably 
linked to said gene encoding the heterologous protein of interest, and 

3. The method of claim 1 wherein said host cell chromosome further 
25 comprises a host cell promoter and said chromosomal transfer DNA further comprises a 
DNA sequence homologous to a segment of the bacterial chromosome, downstream from 
said host cell promoter, said DNA sequence linked in-frame to said gene encoding the 
heterologous protein of interest, 

wherein integration of said chromosomal transfer DNA results in the 
30 formation of an operable linkage between said DNA sequence and the host cell promoter. 
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4. The method of claim 1 wherem said non-bacterial protein of interest 
accumulates within said host cell to a level in excess of 1% of total cell protein, 

5. The method of claim 1 wherein said heterologous protein of interest is 
a eukaryotic protein. 

6. The method of claim 1 wherein said heterologous protein of interest is 
a mammalian protein. 



7. The method of claim 1, wherein each said duplicate DNA comprises 
said gene encoding a heterologous protein of interest Imked to said promoter. 

8. The method of claim 1 further comprising selecting for chromosomal 
aniplification of said chromosomal transfer DNA following integration of said 
chromosomal transfer DNA into the chromosome of said host cell. 

9. A method for producing a chromosomal transfer DNA, comprising: 
ligating a restriction fragment from each of a first plasmid vector and a 

second plasmid vector thereby producing said chromosomal transfer DNA, 

said first vector comprising a gene encoding a heterologous protein of 

mterest lacking an operably linked promoter, 

said second vector comprising a promoter functional in a host cell, 
wherein said chromosomal transfer DNA comprises a selectable marker, 

said gene encoding a heterologous protein of interest operably linked to said promoter 

and duplicate DNA flanking said gene and lacks an origin of replication operable in said 

host cell. 



1 0. A method for producmg a chromosomal transfer DNA , comprismg: 
ligating a restriction fragment from each of a first plasmid vector and a 
second plasmid vector thereby producing a chromosomal transfer DNA, 
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said first plasmid comprising a first gene encoding a heterologous protein 
of interest and a first promoter functional in a host cell, and wherein said first gene and 
first promoter are not operabiy linked, 

said second vector comprising a second gene encoding a heterologous 
5 protein of interest lacking an operabiy linked promoter, and a second promoter functional 
in a host cell, 

wherein said chromosomal transfer DNA comprises a selectable marker 
and lacks an origin of replication operable in said host cell and wherein said first gene is 
operabiy linked to said second promoter on the chromosomal transfer DNA and said 
1 0 second gene is operabiy linked to said first promoter on said chromosomal transfer DNA. 

11. A chromosomal transfer DNA , comprising: 
. a gene encoding a heterologous protein of interest operabiy linked to a 
promoter functional in a host cell; and 
IS a selectable marker, said selectable marker flanked by diq>licate DNA, 

wherein said gene encoding a heterologous protein of interest is at no time 
operabiy linked to a promoter functional in a host cell on a multicopy number plasmid 
vector, 

. 12. A chromosomal transfer DNA, comprising: 
20 two copies of a gene encoding a heterologous protein of interest, each of 

said copies being operabiy linked to a promoter functional in a host cell; and 

a selectable marker, said selectable marker flanked by said copies of said 
gene encoding a heterologous protein of interest, 

wherein each of said copies of said gene are at no time operabiy linked to 
25 a promoter functional in a host cell on a multicopy number plasmid vector. 



wo 96/40722 



48 



PCTAJS96/09006 



AMENDED CLAIMS 

[received by the International Bureau on 10 September 1996 (10.09.96); 
original claims 1-12 replaced by new claims 1-12 (3 pages)] 

1 . A method for producing a heterologous protein of interest, comprising the steps of: 
transferring a chromosomal transfer DNA into a bacterial host cell, wherein said 
chromosomal transfer DNA comprises least one copy of a gene encoding the heterologous 
protein of interest and a selectable marker, and wherein said host cell comprises a chromosome; 

selecting for integration of said chromosomal transfer DNA into said cell 
chromosome resulting in a host cell chromosome comprising a gene encoding a heterologous 
protein of interest operably linked to a promoter functional in the host cell and a selectable 
marker flanked by diq)licate DNA; and 

e)qpressing said gene, wherein said gene is at not time operably Unked to a 
promoter functional in a host cell on a multicopy number plasmid vector during construction of 
the transfer DNA and wherein said heterologous protein of interest accumulates within said host 
cell to a level in excess of 0. 1 % of total cell protein. 

2. The method of claim 1 wherein said chromosomal transfer DNA further comprises a 
promoter functional in said host cell, said promoter being operably linked to said gene encoding 
the heterologous protein of interest, and wherein the operable linkage is created by 
circularization of the chromosomal transfer DNA. 

3. The method of claim 1 wherein said host cell chromosome further comprises a host 
cell promoter and said chromosomal transfer DNA further comprises a DNA sequence 
homologous to a segment of the host cell chromosome downstream from said host cell promoter, 
said DNA sequence linked in*fiame to said gene encoding the heterologous protein of interest, 
wherein integration of said chromosomal transfer DNA results in the formation of an operable 
linkage between said DNA sequence and the host cell promoter. 

4. The method of claim 1 wherein said heterologous protein of interest accumulates 
within said host cell to a level in excess of 1% of total cell protein. 
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5. The method of claim 1 wherein said heterologous protein of interest is a eukaryotic 

protein. 

6. The method of claim 1 wherein said heterologous protein of interest is a mammalian 

protein. 

7. The method of claim 1 wherein each said duplicate DNA comprises said gene 
encoding a heterologous protein of interest linked to said promoter. 

8. The method of claim 1 further comprising selecting for chromosomal amplification of 
said chromosomal transfer DNA following integration of said chromosomal transfer DNA into 
the chromosome of said host cell. 

9. A method for producing a chromosomal transfer DNA comprising: 

ligating a restriction fragment from each of a first plasmid vector and a second plasmid 
vector thereby producing said chromosomal transfer DNA, said first vector comprising a gene 
encoding a heterologous protein of interest lacking an operably linked promoter, said second 
vector comprising a promoter functional in a host cell, M^erein said chromosomal transfer DNA 
comprises a selectable marker, said gene encoding a heterologous protein of interest operably 
linked to said promoter and duplicate DNA flanking said gene and lacks an origin of replication 
operable in said host cell. 
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10. A method for producing chromosomal transfer DNA comprising: 

ligating a restriction fragment from each of a first plasmid vector and a second plasmid 
vector thereby producing a chromosomal transfer DNA, said first plasmid comprising a first gene 
encoding a heterologous protein of interest and a first promoter functional in a host cell, and 
vsiierein said first gene and first promoter are not operably linked, said second vector comprising 
a second gene encoding a heterologous protein of interest lackmg an operably linked promoter, 
and a. second promoter fimctional in a host cell, wherein said chromosomal transfer DNA 
comprises a selectable marker and lacks an origin of replication in said host cell and wherein said 
first gene is operably linked to said second promoter on the chromosomal transfer DNA and said 
second gene is operably linked to said first promoter on said chromosomal transfer DNA. 

11. A chromosomal transfer DNA comprising: 

a gene encoding a heterologous protein of interest operably linked to a promoter 
functional in a host cell; and 

a selectable marker, said selectable marker flanked by duplicate DNA, wherein said gene 
encoding a heterologous protein of interest is at no time operably linked to a promoter fimctional 
in a host cell on a multicopy number plasmid vector. 

12. A chromosomal transfer DNA comprisii^: 

two copies of a gene encoding a heterologous protein of interest, each of said copies 
being operably Imiked to a promoter fimctional in a host cell; and 

a selectable marker, said selectable marker flanked by said copies of said gene encoding a 
heterologous protein of interest, wherein each of said copies of said gene are at no time operably 
linked to a promoter fimctional in a host cell on a multicopy number plasmid vector. 
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